Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FAC - Feature: Restore EIN subcode to /general output #2519

Open
2 tasks done
stucka opened this issue Oct 17, 2023 · 0 comments
Open
2 tasks done

FAC - Feature: Restore EIN subcode to /general output #2519

stucka opened this issue Oct 17, 2023 · 0 comments

Comments

@stucka
Copy link

stucka commented Oct 17, 2023

Is your feature request related to a problem? Please describe.

The EINSUBCODE system was not used consistently, but was used in a thin majority of cases where there were EIN overlaps in the historical system. This helped identify directly comparable companies. While it wasn't a truly unique index, it eliminated about half of the ambiguity that remains when other old-system-specific fields are excluded.

This data item is not present in the current /general API output, but does appear to be worked with, e.g.:
backend/data_distro/mappings/upload_mapping.py: "EINSUBCODE": ["Auditee", "ein_subcode"]

Describe the solution you'd like

Consider bringing back the EIN subcode to better track historical data across years.

Describe alternatives you've considered

No response

Additional context

The CSV has examples of EIN and fiscal year end overlap from historical data. There are roughly 24,000 entries that overlap in that way, and roughly 13,000 have EIN subcodes that could distinguish them.

old-csv-index-attempt1.csv

Code of Conduct

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant