Using a Generator Network Before the Vocoder #41

saeedzou · 2024-08-16T14:41:06Z

Hi,

First of all, great work on this project!

I have a question regarding the architecture. What would happen if you introduced a generator network before the vocoder to generate mel spectrograms, and then trained the generator while using a pre-trained vocoder? I'm curious about how this approach might affect the performance and quality of the generated audio.

Looking forward to your thoughts on this.

RF5 · 2024-08-17T08:05:34Z

Hi @saeedzou

That's an interesting idea! It definitely sounds like it should work fairly well, but I don't have a good feeling for the tradeoffs it might have. It might be that it makes the task of the generator way easier and increases overall performance, or it might be that error accumulated by adding an additional model to the pipeline makes it worse on average. I suspect it might improve things a bit. Regardless, a very cool idea, if you try it out, do be sure to let us know how it goes!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a Generator Network Before the Vocoder #41

Using a Generator Network Before the Vocoder #41

saeedzou commented Aug 16, 2024

RF5 commented Aug 17, 2024

Using a Generator Network Before the Vocoder #41

Using a Generator Network Before the Vocoder #41

Comments

saeedzou commented Aug 16, 2024

RF5 commented Aug 17, 2024