You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Could you add some unit tests to verify that tokens are encoded differently when they are special or not please? Maybe by using the a SpecialTokenMap that changes the default values of XlmRobertaVocab.
The text was updated successfully, but these errors were encountered:
For example, the sentence "Wondering how this will get tokenized 🤔 ?" is not encoded the same way if the following json is passed as the third parameter of XLMRobertaTokenizer
Hello,
Could you add some unit tests to verify that tokens are encoded differently when they are special or not please? Maybe by using the a
SpecialTokenMap
that changes the default values ofXlmRobertaVocab
.The text was updated successfully, but these errors were encountered: