Reading from csv files #364

williamjameshandley · 2024-03-01T18:43:20Z

Description

Following conversations with @yallup, this PR implements reading csv files which have been written with the .to_csv method. Getting this right for all variations on labels/weights being dropped is a little fiddly, and the current implementation prioritises robustness over speed (reading multiple times in some cases), but will function as a low-latency start point.

For consistency this also moves WeightedLabelledPandas into its own file anesthetic.weighted_labelled_pandas.py rather than anesthetic.samples, which reduces clutter for users just looking at the latter file.

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings (pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes (python -m pytest)
I have added tests that prove my fix is effective or that my feature works
I have appropriately incremented the semantic version number in both README.rst and anesthetic/_version.py

codecov · 2024-03-01T18:50:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (3ea992c) to head (0fec842).

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #364   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           34        36    +2     
  Lines         2979      3043   +64     
=========================================
+ Hits          2979      3043   +64

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

williamjameshandley · 2024-03-04T10:57:45Z

@yallup does this perform as expected?

yallup

Just tested and the functionality seems good. Including read_csv into the read_chains logic chain may be useful and then plugs into the gui automatically to boot?

williamjameshandley · 2024-03-04T16:39:06Z

I agree this is a good idea, but leaves a small choice:

If we want it to be consistent with read_chains (read_polychord etc), then this should really take a root rather than a filename (root.csv), which differs from the pandas default read_csv.

Thoughts/preferences?

williamjameshandley · 2024-03-04T16:39:17Z

(could always accept both)

yallup · 2024-03-04T17:06:02Z

I agree this is a good idea, but leaves a small choice:

If we want it to be consistent with read_chains (read_polychord etc), then this should really take a root rather than a filename (root.csv), which differs from the pandas default read_csv.

Thoughts/preferences?

I was going to remark (before checking that it is actually consistent with pandas as is), that root as a filename arg would be preferable. Although we override pandas I think there is enough local precedent to make this conform to the "root" style chains file reading.

williamjameshandley · 2024-03-05T08:47:04Z

OK, I've implemented both, so you can pass root or root.csv to anesthetic.read.csv.read_csv. The gui and read_chains should now both work with csv files.

yallup

Does everything I was looking for, am not testing any of the intricacies around labels but looks good to me(rge)

williamjameshandley added 8 commits March 1, 2024 18:02

Added read_csv for weighted pandas

01ce2a9

Added labelled pandas testing

365b85c

Added weighted_labelled_pandas read_csv

9708866

Added read_csv for NestedSamples

17505fa

Added read_csv to anesthetic

4ee2b73

Updated pydocstyle

6fc79b5

bump version to 2.7.1

4016ccf

bump version to 2.8.0

43fa01d

williamjameshandley requested a review from yallup March 1, 2024 18:46

williamjameshandley added 5 commits March 1, 2024 23:27

updated documentation

2888a4f

Removed inheritance from documentation

f9d3e68

Merge branch 'master' into read_csv

41ada6c

Merge branch 'master' into read_csv

77e6cd9

Merge branch 'master' into read_csv

0ec4a2c

yallup reviewed Mar 4, 2024

View reviewed changes

Updated to include chain reading

0fec842

williamjameshandley requested a review from yallup March 5, 2024 08:47

yallup approved these changes Mar 5, 2024

View reviewed changes

williamjameshandley merged commit 24cbfa5 into master Mar 5, 2024
20 of 22 checks passed

williamjameshandley deleted the read_csv branch March 5, 2024 10:09

AdamOrmondroyd mentioned this pull request Mar 5, 2024

Conda install ignores pandas version requirements #368

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reading from csv files #364

Reading from csv files #364

williamjameshandley commented Mar 1, 2024 •

edited

Loading

codecov bot commented Mar 1, 2024 •

edited

Loading

williamjameshandley commented Mar 4, 2024

yallup left a comment

williamjameshandley commented Mar 4, 2024

williamjameshandley commented Mar 4, 2024

yallup commented Mar 4, 2024

williamjameshandley commented Mar 5, 2024 •

edited

Loading

yallup left a comment

Reading from csv files #364

Reading from csv files #364

Conversation

williamjameshandley commented Mar 1, 2024 • edited Loading

Description

Checklist:

codecov bot commented Mar 1, 2024 • edited Loading

Codecov Report

williamjameshandley commented Mar 4, 2024

yallup left a comment

Choose a reason for hiding this comment

williamjameshandley commented Mar 4, 2024

williamjameshandley commented Mar 4, 2024

yallup commented Mar 4, 2024

williamjameshandley commented Mar 5, 2024 • edited Loading

yallup left a comment

Choose a reason for hiding this comment

williamjameshandley commented Mar 1, 2024 •

edited

Loading

codecov bot commented Mar 1, 2024 •

edited

Loading

williamjameshandley commented Mar 5, 2024 •

edited

Loading