Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using a Generator Network Before the Vocoder #41

Open
saeedzou opened this issue Aug 16, 2024 · 1 comment
Open

Using a Generator Network Before the Vocoder #41

saeedzou opened this issue Aug 16, 2024 · 1 comment

Comments

@saeedzou
Copy link

Hi,

First of all, great work on this project!

I have a question regarding the architecture. What would happen if you introduced a generator network before the vocoder to generate mel spectrograms, and then trained the generator while using a pre-trained vocoder? I'm curious about how this approach might affect the performance and quality of the generated audio.

Looking forward to your thoughts on this.

@RF5
Copy link
Collaborator

RF5 commented Aug 17, 2024

Hi @saeedzou

That's an interesting idea! It definitely sounds like it should work fairly well, but I don't have a good feeling for the tradeoffs it might have. It might be that it makes the task of the generator way easier and increases overall performance, or it might be that error accumulated by adding an additional model to the pipeline makes it worse on average. I suspect it might improve things a bit. Regardless, a very cool idea, if you try it out, do be sure to let us know how it goes!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants