Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add/lstm direction #2455

Merged
merged 6 commits into from
Sep 30, 2024
Merged

Conversation

singjc
Copy link
Contributor

@singjc singjc commented Aug 29, 2024

Hi,

I added a direction field to the LSTMConfig using a enum for forward and reverse direction. I added this so that it's possible to retrieve the weights from reversed layers of a pretrained pytorch model. This also addresses issue #2351.

candle-nn/src/rnn.rs Outdated Show resolved Hide resolved
candle-nn/src/rnn.rs Outdated Show resolved Hide resolved
@LaurentMazare LaurentMazare merged commit aa35bf2 into huggingface:main Sep 30, 2024
8 of 10 checks passed
@LaurentMazare
Copy link
Collaborator

Thanks!

@LaurentMazare
Copy link
Collaborator

Btw if you already have some models that use a bidirectional LSTM, would be great to have them as potential examples in candle-examples.

EricLBuehler pushed a commit to EricLBuehler/candle that referenced this pull request Nov 26, 2024
* add: direction for lstm layer

* lint: remove unused Error import

* refactor: remove unnecessary int assignment to Direction enum:

* refactor: use &'static str type instead of String for direction_str:

* Run cargofmt.

---------

Co-authored-by: Laurent <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants