Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug fixes h5 #1547

Merged
merged 14 commits into from
Feb 13, 2025
Merged

Bug fixes h5 #1547

merged 14 commits into from
Feb 13, 2025

Conversation

Abellegese
Copy link
Contributor

Thank you for taking your time to contribute to Ersilia, just a few checks before we proceed

  • Have you followed the guidelines in our Contribution Guide
  • Have you written new tests for your core changes, as applicable?
  • Have you successfully ran tests with your changes locally?

Description

Fix for parsing and formatting of h5/csv files. It also has a binary splitting approach over the one by one.

Related to ersilia-os/zairachem-docker#7 (comment)

@DhanshreeA
Copy link
Member

@Abellegese could you look at the failing Circle CI test?

@DhanshreeA
Copy link
Member

I tested this PR on eos3ae6 with the DILI train dataset and can confirm that not only do the None values serialize correctly now (a single None value per column instead of a list in a single column), the binary search approach has also made processing significantly faster.

@DhanshreeA DhanshreeA merged commit a6b9aae into master Feb 13, 2025
53 checks passed
@DhanshreeA DhanshreeA deleted the bug-fixes-h5 branch February 13, 2025 06:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants