Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get rid of spaCy dependency #1

Open
AntheSevenants opened this issue Apr 7, 2022 · 0 comments
Open

Get rid of spaCy dependency #1

AntheSevenants opened this issue Apr 7, 2022 · 0 comments
Labels
enhancement New feature or request question Further information is requested

Comments

@AntheSevenants
Copy link
Owner

It seems to be possible to use the built-in tokeniser of Huggingface transformers to do word to token mapping:
https://discuss.huggingface.co/t/generate-raw-word-embeddings-using-transformer-models-like-bert-for-downstream-process/2958/2

The question is whether this would be useful. spaCy offers a lot of other information as well that might be interesting...

@AntheSevenants AntheSevenants added enhancement New feature or request question Further information is requested labels Apr 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant