You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @0dB ,
Existing documentation is exactly as it"s supposed to be.
Lemma of a token is not always necessarily lower-case. for example Proper nouns like London have lemma_ as London and not london. So suggested change will not be an accurate representation of what the stopwords field expect.
In case user want to omit London also as a stopword, the code will look like
Ok, I understand. In my case it was the token "HGB" (acronym for a set of german laws for the B2B sector) that I had to lowercase to scrub it, so I thought this holds for all tokens. Ok, but I did trip over that 😊 Is it worth mentioning to others? You could point out what you wrote, no?
In Sample Document (https://derwen.ai/docs/ptr/sample/) I propose to update:
to
The text was updated successfully, but these errors were encountered: