Add MOM6 support (om4 025jra ryf) #258

marc-white · 2024-11-18T04:23:28Z

Closes #175 .

This PR adds the data requested from #175 , which required a new builder: MOM6Builder.

PR includes relevant builder, translator, and tests.

…tamp' groups in filename regexp

…om4_025jra_ryf

…list of xfailing tests

codecov · 2024-11-18T04:25:54Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.44%. Comparing base (4fb9856) to head (428f0d2).
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #258      +/-   ##
==========================================
+ Coverage   97.90%   98.44%   +0.54%     
==========================================
  Files          11       11              
  Lines         811      837      +26     
==========================================
+ Hits          794      824      +30     
+ Misses         17       13       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

charles-turner-1 · 2024-11-18T04:29:07Z

All tests passing - just codecov that's not passing.

I guess that must mean that CI environment isn't mirroring Gadi correctly..

charles-turner-1

Couple of minor comments & a bunch of empty files I think got accidentally committed.

Otherwise looks good - the only thing I think that might warrant some extra thought is the EmptyDataError ( inmanager.py) - I've left a comment here, I'm not sure that EmptyDataError is the most appropriate?

charles-turner-1 · 2024-11-19T05:02:28Z

src/access_nri_intake/catalog/manager.py

+                columns_with_iterables=COLUMNS_WITH_ITERABLES,
+            )
+        except EmptyDataError as e:
+            raise EmptyDataError(str(e) + f": {self.path}")


Perhaps an issue for intake-dataframe-catalog rather than here, but I feel like we might want to emit a DfFileCatalogError here rather than than an EmptyDataError?

I think that is an issue for intake-dataframe-catalog, as you suggested. All I was trying to do is re-emit the same error with a slightly more useful message.

I've opened an issue there. Could you update the catch to

except (EmptyDataError, DfFileCatalogError) as e

so it won't break when we update it there?

src/access_nri_intake/source/builders.py

charles-turner-1 · 2024-11-19T05:07:31Z

src/access_nri_intake/source/builders.py

+            return ncinfo_dict
+
+        except Exception:
+            return {INVALID_ASSET: file, TRACEBACK: traceback.format_exc()}


Codecov is complaining that this line isn't tested - tbh I think it's unimportant.

I'm assuming the equivalent lines aren't tested in the other Builders, but I might look into that.

charles-turner-1 · 2024-11-19T05:09:29Z

tests/data/mom6/output000/manifests/input.yaml

Manifest files all empty - probably an accidental git add . instead of git add --update ?

I'll need to check on Gadi tomorrow - I tried to ape the 'real' structure there as much as possible, and there may be empty manifest yamls on there. Whether it's necessary for testing or not is another matter.

I think there's a tool I've used which detects unused test data... I'll see if I can dig it out. I think it would be good to keep unused data out as far as is possible.

OK, let me know if you find the tool. The aim was to give the build system access to 'furphy' files to make sure they weren't accidentally ingested as real data (c.f. the access-om3 test data directory).

I've done some digging and I think the tool I was thinking of was vulture, so I'm not sure that there's a way of automating checking for unused files?

I don't wanna hold up getting this merged into main - maybe we just raise a separate issue that this PR includes a lot of test data, some of it potentially unused, and come back later?

charles-turner-1 · 2024-11-19T05:37:04Z

tests/data/mom6/error_logs/env.28342543.gadi-pbs.yaml

I think this got accidentally committed?

See above comment re: duplicating the file structure on Gadi.

…om4_025jra_ryf

charles-turner-1

Looks good to me.

I still think we might want to look into reducing the amount of added test data if we can figure out whether there are redundant test data files, but I think that can be dealt with in a separate issue.

rbeucher · 2024-11-26T01:03:33Z

I agree with @charles-turner-1. Would be good to fix codecov and pre-commit though.

marc-white · 2024-11-26T05:16:25Z

I've now made enough additional tests to hit the codecov requirements, so I'll merge.

rbeucher · 2024-11-26T05:34:30Z

Great effort @marc-white

This reverts commit 8d18b19. Testing to see whether this restores test stability

This reverts commit 23b3b5f - ie. it restores the mom6 stuff.

charles-turner-1 · 2024-12-06T04:26:17Z

tests/test_builders.py

+                filename="19000101.ice_daily.nc",
+                file_id="XXXXXXXX_ice_daily",
+                filename_timestamp="19000101",
+                frequency="subhr",


@marc-white see frequency bug we missed when merging

Fixes issues with MOM6 testing, and time parsing of same. * Revert "Add MOM6 support (om4 025jra ryf) (#258)" This reverts commit 8d18b19. Testing to see whether this restores test stability * Revert "Revert "Add MOM6 support (om4 025jra ryf) (#258)"" This reverts commit 23b3b5f - ie. it restores the mom6 stuff. * Add xarray complete * Pin dependencies back in time for 3.11 * Fail fast false * Pin a bunch of deps * Added toxfile * tox.ini w/ comments on failures * Revert "Pin dependencies back in time for 3.11" This reverts commit 6fa6676. * These changes are ugly & horrible but mostly seem to resolve the issues with cftime. THey cause some assets to fail because they alter the parser, but I think this is a window into a solution. * Lots of catches for overflow errors: keeping for posterity * Restore 'test_builders.py' to same state as main * Fix mom6 tests - should now all be failing * Ready to replace time info guesses for MOM6 with a subclass * Fixed broken MOM6 builder * re-enable fast fail * Reverted CI environments to main * Removed '_access' from a bunch of function names - now we have GFDL models in the builders, this is misleading * Revert load_dataset => open_dataset * Updating test to fix coverage * Restored to working state * Tests for GenericTimeParser & AccessTimeParser * Improve test coverage * Improve test coverage for GfdlTimeParser * Improve test coverage for GfdlTimeParser * Improve test coverage for GfdlTimeParser * Marc's comments

marc-white added 30 commits August 20, 2024 16:48

Factor out the 'static' frequency tag as a variable

1d040c7

Initial creation of AccessOm4Builder and test data structure

cb8b017

Correct exclude pattern, add Om4Builder to test data suite

7e14a52

Improved OM4 test data (still doesn't work though)

2428c2d

Fixed test data, fixed filename regexp

a13de8b

Add FIXME for timestamp part of PATTERN regex

ed2ba42

Add test_builder_parser tests for OM4

463005f

Add test_parse_access_filename tests for OM4

6829eb0

Final tests for AccessOM4 builder (plus another TODO)

6796f42

Add new pattern for OM4; add ability to have multiple redacted 'times…

138e2a6

…tamp' groups in filename regexp

OM4 test expansion

79763cc

Added test data for panan-01-zstar data

3212a99

Refactor AccessOm4Builder --> Mom6Builder

b6e9d41

Merge remote-tracking branch 'origin/main' into 175-data-request-add-…

2956b7b

…om4_025jra_ryf

Add first MOM6 datasets

0d476c6

Merge remote-tracking branch 'origin/main' into 175-data-request-add-…

24d97eb

…om4_025jra_ryf

Merge branch 'main' into 175-data-request-add-om4_025jra_ryf

b3ab3bd

Merge branch 'main' into 175-data-request-add-om4_025jra_ryf

120d92c

Add test print for debugging

720cad1

Remove asset print debug

3fa73e7

Add better error statement to CatalogManager __init__

fc582ce

Expand out ParserError message in validate_parser

6e7a0e9

Tweak ParserError message

fa89393

Further expand ParserError message

d428726

Add first pass at MOM6 translator (hack job)

bf9450d

Next pass at MOM6 translator

6f31a53

Remembered to change the configured Translator

a95194c

Try MOM6 translator again...

0eb2a97

Merge remote-tracking branch 'origin/main' into 175-data-request-add-…

581e9ff

…om4_025jra_ryf

Fix up MOM6 builder after last merge

b7856aa

marc-white and others added 3 commits November 14, 2024 13:44

Fix MOM6 test variables etc.

3531397

Added docstring to parse_access_ncfile to make issue clear & updated …

4aac81b

…list of xfailing tests

Merge branch 'main' into 175-data-request-add-om4_025jra_ryf

6dfbd89

marc-white linked an issue Nov 18, 2024 that may be closed by this pull request

[DATA REQUEST] Add COSIMA Panantarctic / GFDL_OM4 Builder & Data #175

Closed

5 tasks

Remove redundant MOM6 translator

47166db

marc-white marked this pull request as ready for review November 19, 2024 03:46

marc-white requested review from charles-turner-1 and dougiesquire November 19, 2024 03:46

charles-turner-1 reviewed Nov 19, 2024

View reviewed changes

marc-white changed the title ~~DRAFT: Add MOM6 support (om4 025jra ryf)~~ Add MOM6 support (om4 025jra ryf) Nov 19, 2024

marc-white added 3 commits November 19, 2024 17:50

PR updates to builders.py

4331abc

Update to except statement in CatalogManager

5705d33

Merge remote-tracking branch 'origin/main' into 175-data-request-add-…

97f18db

…om4_025jra_ryf

marc-white requested a review from charles-turner-1 November 25, 2024 00:23

charles-turner-1 approved these changes Nov 25, 2024

View reviewed changes

Merge branch 'main' into 175-data-request-add-om4_025jra_ryf

bdb309b

marc-white added 4 commits November 26, 2024 12:10

Ruff fix

101c948

Improve test coverage of Builder parser exceptions

cc2f6e5

Improve manager test coverage

2637426

Improve builders test coverage

428f0d2

marc-white merged commit 8d18b19 into main Nov 26, 2024
18 checks passed

charles-turner-1 added a commit that referenced this pull request Dec 4, 2024

Revert "Add MOM6 support (om4 025jra ryf) (#258)"

23b3b5f

This reverts commit 8d18b19. Testing to see whether this restores test stability

charles-turner-1 added a commit that referenced this pull request Dec 4, 2024

Revert "Revert "Add MOM6 support (om4 025jra ryf) (#258)""

16c93f9

This reverts commit 23b3b5f - ie. it restores the mom6 stuff.

charles-turner-1 reviewed Dec 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MOM6 support (om4 025jra ryf) #258

Add MOM6 support (om4 025jra ryf) #258

marc-white commented Nov 18, 2024

codecov bot commented Nov 18, 2024 •

edited

Loading

charles-turner-1 commented Nov 18, 2024 •

edited

Loading

charles-turner-1 left a comment

charles-turner-1 Nov 19, 2024

marc-white Nov 19, 2024

charles-turner-1 Nov 19, 2024

charles-turner-1 Nov 19, 2024

marc-white Nov 26, 2024

charles-turner-1 Nov 19, 2024

marc-white Nov 19, 2024

charles-turner-1 Nov 19, 2024 •

edited

Loading

marc-white Nov 19, 2024

charles-turner-1 Nov 19, 2024 •

edited

Loading

charles-turner-1 Nov 19, 2024

marc-white Nov 19, 2024

charles-turner-1 left a comment

rbeucher commented Nov 26, 2024

marc-white commented Nov 26, 2024

rbeucher commented Nov 26, 2024

charles-turner-1 Dec 6, 2024

Add MOM6 support (om4 025jra ryf) #258

Add MOM6 support (om4 025jra ryf) #258

Conversation

marc-white commented Nov 18, 2024

codecov bot commented Nov 18, 2024 • edited Loading

Codecov Report

charles-turner-1 commented Nov 18, 2024 • edited Loading

charles-turner-1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

charles-turner-1 Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

charles-turner-1 Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

charles-turner-1 left a comment

Choose a reason for hiding this comment

rbeucher commented Nov 26, 2024

marc-white commented Nov 26, 2024

rbeucher commented Nov 26, 2024

Choose a reason for hiding this comment

codecov bot commented Nov 18, 2024 •

edited

Loading

charles-turner-1 commented Nov 18, 2024 •

edited

Loading

charles-turner-1 Nov 19, 2024 •

edited

Loading

charles-turner-1 Nov 19, 2024 •

edited

Loading