The comment in the Bi-LSTM (Attention) model has an issue. #84

tmracy · 2024-09-10T13:49:45Z

The comment # output : [batch_size, len_seq, n_hidden] should indeed be corrected to # output : [batch_size, len_seq, n_hidden*2] because the Bi-LSTM model is bidirectional. In a bidirectional LSTM, the hidden size is effectively doubled, as it concatenates the forward and backward hidden states. Therefore, the correct shape of the output after permutation is [batch_size, len_seq, n_hidden * 2].

The text was updated successfully, but these errors were encountered:

bbzxc · 2025-01-09T15:32:53Z

Could you please tell me where the Bi-LSTM (Attention) method was proposed? Is there a paper or an algorithm introduction available?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The comment in the Bi-LSTM (Attention) model has an issue. #84

The comment in the Bi-LSTM (Attention) model has an issue. #84

tmracy commented Sep 10, 2024

bbzxc commented Jan 9, 2025

The comment in the Bi-LSTM (Attention) model has an issue. #84

The comment in the Bi-LSTM (Attention) model has an issue. #84

Comments

tmracy commented Sep 10, 2024

bbzxc commented Jan 9, 2025