Uberduck Written Tutorial
  • ‼️BEFORE YOU ASK ANYTHING, READ THIS FIRST!
  • 😊Introduction
  • 🗒️Plans and Updates
  • Discord Server Usage
    • Uberduck's Discord server walkthrough 100%
  • Requirements
    • 🤔Which Sources Should I Start?
    • 🛠️Requirements
  • Gathering Dialogs
    • Intro
      • Audacity Tutorial (WIN/MAC/Linux Users)
      • RX7/8 Tutorial (WIN/MAC Users)
      • FL Studio Tutorial (WIN/MAC Users) (WIP)
      • Vegas Pro Tutorial (Windows Only)
      • FFmpeg/FFBatch Tutorial (WIN/MAC/Linux Only)
      • Web version (Any platform)
  • Transcribing and fixing audios
    • Transcribing Dialogues
    • Transcribing Dialogues (Web Version)
    • Templates
    • Vocal Remover Colab Page Guide
  • Dataset sharing
    • Compressing Dataset
  • 🌮Installing Tacotron 2 Locally
    • 🐍Python Clean Install
    • 💻Miniconda Setup
  • Training time
    • Easy or Hard?
    • Training Audio With Tacotron 2 Colab Page
      • Simplified Training Page
      • Advanced Training Page
    • Setting Up for Local Training (WIP)
  • Testing Audio model
    • Testing Voice Model
    • Testing Voice Model (CPU)
  • Troubleshoot and bugs
    • Troubleshooting
  • Extra Stuff
    • Talknet usage
  • Wrapping up
    • The end
Powered by GitBook
On this page
  • Usage synthesis page
  • Ending this section

Was this helpful?

  1. Testing Audio model

Testing Voice Model (CPU)

PreviousTesting Voice ModelNextTroubleshooting

Last updated 3 years ago

Was this helpful?

Alright now that you've done training your voice model it's time to test your voice model

Be sure to copy to your drive.

Usage synthesis page

Alright so in this main page, you might be confused on where to's. Don't worry, we'll follow everything like it says in a colab page. So let's go for step by step.

This will setup everything it needs (e.g downloading resources needs like tacotron2 and HIFI-GAN, etc)

Now this is where the main part starts. First off, you need get a voice model that you trained, go to your google drive and select this folder:

And after that you right click the model that you made:

After that, copy that ID, NOT THE ENTIRE LINK! THE ONE WITH AN ID ON IT THAT STARTS WITH "https://drive.google.com/file/d/*YOUR ID*/view?usp=sharing"

And then you paste it in tacotron_id: *YOUR ID*

Now in second section you won't need it, it's really that optional, unless if you want to make it sound better, then only if you have fine-tuned model to it, which we'll do that later on.

If you did fine-tuned model, do the same thing with google drive's id link to it.

Otherwise, keep universal.

The third option is pronounciation_dictionary.

If you have ARPABETS in your voice model then better check that, otherwise keep it unchecked if your transcript is english and not ARPABETS.

Fourth option is show_graphs.

Really optional if you wanna see some results yourself (i recommend keeping that on because you may need some of that graph.)

Finally, max_duration.

Depends how long you want your wav file to be, but don't put it too long or else it might synthesis long, or it'll just not play it for you and show an error.

Now once that's all with it click play and wait. Until you'll see this:

Just type something or maybe grab some copypasta until you'll see this:

If you happy with the results then congratulations! You can now upload to uberduck now!

If your model sounds bad nor even spectrogram is just wonky then keep training or grab some more voice files until it gets better.

Ending this section

Congratulations, you have successfully made an voice model. Again you can upload it to uberduck or just keep it private on your own.

If you have some errors go ahead for the next page of Troubleshooting or always ask in Uberduck's discord server.

Colab page
Aw jeez rick.
THE ONES YOU SHOULD COPY IS THE ID ITSELF, NOT THE ENTIRE LINK!
Ignore that python error, it's not that important.