-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pretrained Models #4
Comments
I downloaded them from huggingface, and I guess you should put them in the directory mentioned in each conf, for example: |
@Hubotcoder @lyutyuh Huggingface contains pretrained T5 but if I understand right there needs to be additional weights (feedforward networks) for the specific tasks like NER? Please correct me if I am wrong. |
You are right. Now I get what you mean. Let's wait for the author. |
@Hubotcoder, tried contacting the authors via mail but no reply even there. How did you advance with this repository? Were you able to train the models? Thank you |
@KawshikManikantan Hello. Not yet. I am still asking my boss to buy me a GPU. |
May be too late, but I've been able to start training t5_large based ere model with flant5_large_conll04 config (except The last results are this
As you may see, It doesn't performs evaluation on test dataset. UPD: it doesn't perform test evaluation on the run, but there is a separate method to run the evaluation.4000 step result (27 epochs)
|
That is a really interesting idea proposed. Thanks for the release of the codebase. Can you please tell me when the pre-trained models would be released?
The text was updated successfully, but these errors were encountered: