You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for reporting this issue @paanguin. Actually, we are working on revamping the project with a newer version soon.
Yes, I agree we need to fix this part. There should be a single SOS and EOS tokens. We should not include SOS and EOS tokens in SpectrogramDataset. We will fix this issue in the next update.
I have a question about decoder inputs. I think the following pre-processing adds SOS and EOS token to label y.
end2end-asr-pytorch/models/asr/transformer.py
Line 261 in a22efdd
end2end-asr-pytorch/models/asr/transformer.py
Line 262 in a22efdd
It seems
SpectrogramDataset
also contain a process for adding SOS and EOS to label y.end2end-asr-pytorch/utils/data_loader.py
Line 137 in a22efdd
But I think
SpectrogramDataset
should not do this. I think the decoder currently process the label like this:y= HELLO
seq_in
: SOS, SOS, H, E, L, L, O, EOSseq_out
: SOS, H, E, L, L, O, EOS, EOSI'll be very grateful if you confirm whether this is correct or not.
The text was updated successfully, but these errors were encountered: