-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should the cookbook default to stricter requirements when merging/concatenating data? #319
Comments
Thanks for catching this @dougiesquire. I'm not sure this is a cookbook issue. I think it's more an issue with the data itself. I don't think it's a good idea to have output defined on different regions using the same file name. I'd suggest that a good way to deal with this issue is to rename the But I guess even then, it would still be useful to flag it so that the user knows they have to use |
Thanks @rmholmes. I wasn't meaning to suggest that the issue is with the cookbook, but having My guess is that most uses of the cookbook are to query/load datasets that should have consistent indexes. So having |
But yes, for fixing the specific issue with |
Sorry @dougiesquire, I didn't completely take in your comment here as I'd copied my response across from the cosima-recipes issue you'd put up. I'd support a move to the stricter requirements. I think @AndyHoggANU ran that simulation. |
I agree that |
Is there any time penalty with |
I think the checks by |
Agreed. I wouldn't expect any difference in speed for data that can be joined. As @angus-g mentioned, there are other EDIT: see the Note here: https://docs.xarray.dev/en/stable/user-guide/io.html#reading-multi-file-datasets |
Motivating example here: COSIMA/cosima-recipes#229. Files with the same naming in a single experiment are on different domains depending on the
output*
directory. Should the cookbook check whether indexes are the same for data being merged/concatenated?E.g. passing
join="exact"
to theopen_mfdataset()
call within the cosima-cookbook will then return an error when indexes to be aligned are not equal. This could currently be passed through kwargs, but should it be the default?The text was updated successfully, but these errors were encountered: