-
Notifications
You must be signed in to change notification settings - Fork 213
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to extract alignment from tacotron2? #88
Comments
Why are alignments used for after all? Tacotron-2 paper will not mention alignments. |
I found this in FastSpeech2 paper:
This speaks clearly that you need another trained model to work with FastSpeech custom dataset, which is not so smart. Or, the alignments are such a big problem, because based on those alignments the the training is possible. No alignments, no training. This paper "FastSpeech" is worth inspecting to understand how is done (in principle), but for some out of the box training possible is not the best choice. You may find the |
Thank you, i found alignments.py previous commit and tried it. In result, synthesis quality not bad, but when i inference long sentence long than five or six words, there was stuttering and missing letters problem in synthesis. Now i try FastSpeech2. Alignments are really such a big problem. |
Hi, i have the same question. I also try to train my language with FastSpeech2, but alignments are really difficult. |
@CanKorkut Hi, i'm using that alignment.py (Commit id: e11b60d) to extract alignments files but the result show different dimension with LJSpeech alignment files (in this source code Fast Speech already had). Can you show me your code to extract exactly alignment files to train another language ? thank you |
Hi,
I want to try fastspeech on different dataset. therefore, can you share how to extract alignment from tacotron2?
I tried this code, but get bad result for synthesis when inference long sentence .
_, _, _, alignments = model.inference(sequence)
d = alignments.float().data.cpu().numpy()[0].T
x = np.zeros(d.shape[0])
for i,y in enumerate(d):
x[i] = y.sum()
np.save("path_to_save_folder"+name+".npy",x.astype(np.dtype('i4')))
Thank you.
The text was updated successfully, but these errors were encountered: