-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Separate out processing permanent counts from countmatch.reader #32
Comments
countmatch.reader
Had a rethink of this task, and now realize why Arman was reluctant to start mucking around with data imputation of permanent count stations, especially if he already had PECOUNT working to fill in missing ~2-3 months of data.
|
Revised class design:
Revised work plan:
Outstanding question: if |
As I'm refactoring to allow for imputation and other multi-year preprocessing of permanent counts, I've created a design where all data from a count location across multiple years are stored in the same object. The amount of data taken at a location can drastically change from year to year, and the permanent count criteria are defined in terms of a single year's data, so currently each location's data is checked year by year using There are a number of issues with this design:
Alternative design:
|
Decided to adopt points 1 and 3, but not 2. |
Resolved by #37 |
Based on comments in #25 and discussions with @aharpalaniTO, we'll be revising how we treat permanent counts. To do this, we can no longer identify and process permanent count locations while reading in data year-by-year, since data imputation and outlier detection for permanent count stations requires data from other years.
To resolve this:
AnnualCount
. These will eventually go intoAnnualCount
toRawAnnualCount
, since it's only used byReader
and gets discarded as soon as multiple years are combined into single multi-index tables.Reader
so that it no longer distinguishes between PTCs and STTCs - all locations will be handled the same way.growthfactor.py
intopermcount.py
, which will handle all permanent count processing, including outlier detection. We'll move methods into their own files in a subsequent issue ifpermanentcount.py
gets too long. This will also solve one point in Clean up and expand countmatch.growthfactor #26 .The text was updated successfully, but these errors were encountered: