Facodec and training #7

yiwei0730 · 2024-03-28T02:40:12Z

I saw that there is another library called super-gpt-facodec.
Is there any chance I can connect them in series?
And I want to ask about training issues
Should super-gpt-facodec and supervoice be trained separately?
Is there a step-by-step guide I can borrow?

ex3ndr · 2024-03-28T02:43:44Z

I tried to do FACodec training, but haven't succeeded. In general i am not satisfied with FACodec performance.

yiwei0730 · 2024-03-28T03:25:37Z

thanks for your answer
What if I need to train in the use of Chinese and English (about 400,000 pieces of information in total)
@ex3ndr Do I need to train the two libraries supervoice-gpt and supervoice separately?
What are the steps for training?

rishikksh20 · 2024-03-31T10:44:57Z

I also discourage by facodec performance.

yiwei0730 · 2024-04-01T07:42:47Z

I would like to ask if you can describe in detail how you feel about the effect after testing it.
I think he has emphasized that the key to NS3 is the FACODEC, but the amount of parameters added and used in NS3 is indeed large enough.

ex3ndr · 2024-04-01T17:58:16Z

If this is question for me, my main problem is that in is frame-based and doing prediction of this tokens would be challenging because how much of them you needed. Also my tests didn't show what promised in papers - codes are not disentangled - you need residual codes for nice voice, content codes are dependent on speaker identity still, etc Steve Korshakov Sent via ***@***.***> On Mon, Apr 01, 2024 at 12:43 AM, yiwei0730 ***@***.******@***.***>> wrote: I would like to ask if you can describe in detail how you feel about the effect after testing it. I think he has emphasized that the key to NS3 is the FACODEC, but the amount of parameters added and used in NS3 is indeed large enough. — Reply to this email directly, view it on GitHub<#7 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AADB2E3MP2EG4HUDMH3WKQ3Y3EFYZAVCNFSM6AAAAABFL7XNFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRZGMZDKOBVGA>. You are receiving this because you were mentioned.Message ID: ***@***.***>

rishikksh20 · 2024-04-02T05:19:46Z

@yiwei0730 I have the same conclusion as @ex3ndr , Gradient Reversal is not a 100 % working mechanism and FACodec mostly relied on that which results in information leakage between the codes That is why codes are not properly disentangled. It might work with NS3 architecture because all codes ultimately combine before feeding to the decoder, but if you plan to use the codec separately it won't work properly and result in poor quality.

ex3ndr · 2024-04-02T07:17:39Z

Well I don’t think the results are good enough to slam diffusion on top of it too: the problem of voice box is that it is too versatile and you need to control it, but this tokens probably are too tied to specific speech styles to be useful. Steve Korshakov Sent via Superhuman ***@***.***> On Mon, Apr 1 2024 at 10:20 PM, Rishikesh ***@***.******@***.***>> wrote: @yiwei0730<https://github.com/yiwei0730> I have the same conclusion as @ex3ndr<https://github.com/ex3ndr> , Gradient Reversal is not a 100 % working mechanism and FACodec mostly relied on that which results in information leakage between the codes That is why codes are not properly disentangled. It might work with NS3 architecture because all codes ultimately combine before feeding to the decoder, but if you plan to use the codec separately it won't work properly and result in poor quality. — Reply to this email directly, view it on GitHub<#7 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AADB2EZCABWKJPW4GXPX5PDY3I5YPAVCNFSM6AAAAABFL7XNFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZRGEYDCOBWG4>. You are receiving this because you were mentioned.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Facodec and training #7

Facodec and training #7

yiwei0730 commented Mar 28, 2024

ex3ndr commented Mar 28, 2024

yiwei0730 commented Mar 28, 2024 •

edited

Loading

rishikksh20 commented Mar 31, 2024

yiwei0730 commented Apr 1, 2024

ex3ndr commented Apr 1, 2024 via email

rishikksh20 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024 via email

Facodec and training #7

Facodec and training #7

Comments

yiwei0730 commented Mar 28, 2024

ex3ndr commented Mar 28, 2024

yiwei0730 commented Mar 28, 2024 • edited Loading

rishikksh20 commented Mar 31, 2024

yiwei0730 commented Apr 1, 2024

ex3ndr commented Apr 1, 2024 via email

rishikksh20 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024 via email

yiwei0730 commented Mar 28, 2024 •

edited

Loading