How do you perform voice cloning with this webui? #118

wisplite · 2023-08-22T08:54:43Z

wisplite
Aug 22, 2023

I was looking through the Wiki and the various readme files and none of them seem to detail how you actually do text-to-audio voice cloning with this webui. Is there a way to do this?

Answered by gitmylo

Aug 22, 2023

Text-to-audio voice cloning can be achieved with bark on the text to speech tab, using the clone option in the voice select, then you'll just optionally name and upload your audio file, your next generation will then use the voice of the target speaker.

If it's not like your speaker voice enough, you can usually still improve it with RVC after, but this will require a model of the target speaker.

View full answer

gitmylo · 2023-08-22T11:42:38Z

gitmylo
Aug 22, 2023
Maintainer

Text-to-audio voice cloning can be achieved with bark on the text to speech tab, using the clone option in the voice select, then you'll just optionally name and upload your audio file, your next generation will then use the voice of the target speaker.

If it's not like your speaker voice enough, you can usually still improve it with RVC after, but this will require a model of the target speaker.

1 reply

dangerweenie Aug 8, 2024

Text-to-audio voice cloning can be achieved with bark on the text to speech tab, using the clone option in the voice select, then you'll just optionally name and upload your audio file, your next generation will then use the voice of the target speaker.

If it's not like your speaker voice enough, you can usually still improve it with RVC after, but this will require a model of the target speaker.

Thanks for this info - very helpful for someone just getting started.

Do you know if there is a list of what filetype is used for what? I just downloaded a bunch of .index, .pth, .npy files from an AI voice repo, but after putting them in the models/RVC folder, they're still not showing up in audio-webui.

wisplite · 2023-08-22T21:56:54Z

wisplite
Aug 22, 2023
Author

Now I'm running into a different issue. There seems to be a potential memory leak? When cloning it loads the Hubert model and tokenizer into memory just fine, but the moment it starts extracting semantics it fills up 20gb of system memory (the rest of my 32gb of memory after bark is loaded), causing the Linux kernel to kill tons of processes to prevent a crash, including the webui.

0 replies

gitmylo · 2023-08-22T22:03:51Z

gitmylo
Aug 22, 2023
Maintainer

how big is your audio file? you're supposed to use at most 15 seconds

0 replies

wisplite · 2023-08-22T22:05:03Z

wisplite
Aug 22, 2023
Author

Oh, I was not aware of that, I was training it with 28 minutes of audio lmao. I assumed it would train similar to how RVC does. Thanks for all your help with this!

2 replies

gitmylo Aug 22, 2023
Maintainer

Yeah, it doesn't train, it actually extracts the speech and translates it so bark can continue it

wisplite Aug 22, 2023
Author

That explains a lot of the issues I was having. I was under the assumption it was training, but that's my bad. Again, thanks for helping with this!

gitmylo · 2023-09-29T13:26:31Z

gitmylo Sep 29, 2023
Maintainer

Please don't advertise paid software, thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do you perform voice cloning with this webui? #118

{{title}}

Replies: 5 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

This comment was marked as off-topic.

{{title}}

Select a reply

How do you perform voice cloning with this webui? #118

wisplite Aug 22, 2023

Replies: 5 comments · 4 replies

gitmylo Aug 22, 2023 Maintainer

dangerweenie Aug 8, 2024

wisplite Aug 22, 2023 Author

gitmylo Aug 22, 2023 Maintainer

wisplite Aug 22, 2023 Author

gitmylo Aug 22, 2023 Maintainer

wisplite Aug 22, 2023 Author

This comment was marked as off-topic.

gitmylo Sep 29, 2023 Maintainer

wisplite
Aug 22, 2023

Replies: 5 comments 4 replies

gitmylo
Aug 22, 2023
Maintainer

wisplite
Aug 22, 2023
Author

gitmylo
Aug 22, 2023
Maintainer

wisplite
Aug 22, 2023
Author

gitmylo Aug 22, 2023
Maintainer

wisplite Aug 22, 2023
Author

gitmylo Sep 29, 2023
Maintainer