🛠️Requirements

Let's start things off with the requirements of what program should we use.

Grabbing from Youtube.

If you wanna do a voice model from a youtube video, you may go for programs called:

4K Video Downloader

Let's start off with a 4K Video Downloader, this software I would say... freeware. This is free but then if you want to download all videos in the playlist, you gonna have to pay a license for it. Yeah... I know, you may say "I'm poor I don't have money" but you don't even have to tho. That's the thing.

Youtube-DLP

The other thing is Youtube-DL, if you know how to use terminal/CMD you should use youtube-dl which I use. I won't go further explaining what to do with youtube-dl but googling and youtube tutorials are your best bet. You should probably use youtube-dlp for that, because youtube-dl isn't updating anymore and someone's taking care of it, which works better than youtube-dl tho. It'll work the same as youtube-dl don't worry.

Here's probably one thing I use for youtube-dlp:

youtube-dlp -x --audio-format mp3 -f best *youtube url* -o "*location path*\%%(title)s.%%(ext)s"

So, let's start one by one.

  • youtube-dlp is... DUH an executable.

  • -x --audio-format mp3 is basically telling youtube-dl to turn video into audio file (which is mp3 but you can also try typing wav)

  • -f best tells youtube-dl to put the HIGHEST QUALITY audio of all time.

  • *youtube url* pretty basically you gonna have to pull youtube URL on your browser.

  • -o is your output, location to where to save. For example: D:\stuff\%%(title)s.%%(ext)s

  • oh and %%(title)s.%%(ext)s that's basically grabs file name.

If you have followed enough and it worked for you then congratulations you have now acquired an audio file from youtube.

About TV shows, movies, and a cartoon. You gonna have to acquire yourself.

Audio editor

Alright here's one thing you should use:

Audacity

Yes, that's the one! Audacity is a free open-source audio editor. If this might be complicated for you, no worries, manuals, quickstarts, and youtube videos can help you with that.

What if I have money to buy things?

If you have enough cash to buy some audio editors then here's what you should get:

iZotope RX7/8/9

iZotope RX7/8/9 is the only thing you need, if you have enough money to buy advanced, you should good to go. Although, let's say standard and elements, are not good ones, since they have limits.

FL Studio

FL Studio, I know I know, you might be saying it's just "music" production. But you see, it's not just for music, it's a DAW (Digital Audio Workstation) which you can do some audio designing and grabbing dialogs too.

Vegas Pro

I'm pretty much sure you know this software? If not then that's Vegas Pro. It's a video editor but also an audio editor. It works grabbing dialogs too, but yeah.

Text Editor

So here, we are gonna do now is the text editors. Here are the two choices:

Notepad

Not a good choice but it'll work.

Notepad++

This software is definitely recommended for text editors. It has cool features and stuff. You'll get used to it until you'll go straight from basic to the advanced user with regular expressions.

Audio Conversion

If you end up having a lot of mp3, ogg, or maybe wavs but they are not in the same like mono, 16-bit, and 22050 sample rate then we need to convert them as much as we can.

Basically what you need for audio conversion is... Well, I have only one. Free conversions like Freemake it'll work but I'd say meh for that one. But here's one:

FFmpeg

This might be complicated for you and it is for advanced users who have experienced with commands and stuff. So don't use FFmpeg if you are not that experienced with terminal/CMD.

FFBatch

This might be an easy conversion that I usually use. So if you are not an advanced user of FFmpeg, go ahead download this one instead.

Audio Requirements

Right forgot to tell you one thing. This may confuse you but, the audio for training you need is a .wav file, and also it needs to be 16-bit, sample rate 22050, and mono. Or else you'll end up with those errors.

I also recommend having around 100 or more because the more dialogues you will have, the better sound will be.

One more important thing is that your dialogues should be 1.5 to 12 seconds. If you gonna make them long it's not gonna work. So we'll be covering splitting dialogues into gathering dialogues section

Remember, your dialogues shouldn't be noisy. If your dialogues would be noisy I recommend clearing noises using some programs like RX7/8 or using vocal remover colab page

End here

Well, now that we've covered let's get start it!

Last updated