-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
USA not recognized #23
Comments
I'm also not able to extract a country for "USA" using geotext. It looks like "UK" also does not produce a match: import geotext
text = "UK"
geo_text = geotext.GeoText(text)
dict(geo_text.country_mentions) returns |
Check the demo data carefully. GeoText does not use synonyms in its lookup. |
Are you talking about the data in geotext/geotext/data/countryInfo.txt Line 285 in 21a8a7f
geotext/geotext/data/countryInfo.txt Line 8 in 21a8a7f
geotext/geotext/data/countryInfo.txt Line 128 in 21a8a7f
First, isn't |
The data in |
Hi guys, we don't include ISO because the approach used in Geotext (rule based regex) is based on high precision rules (so you can almost be certain that it is correct when it works). The drawback is that we lose some recall. So I prefer the approach of providing basic functionality with high precision and leaving the responsibility of extending recall to users ex what @lisiq did (preprocessing the data). What we could do is improve the API to make it easier to add your own exceptions. That said flashgeotext from @iwpnd looks great. Please try it out and let me know if we should join efforts there. |
"USA" is not being detected. I have to replace "USA" to "United States" in order the country to be detected.
The text was updated successfully, but these errors were encountered: