Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset from 24Golden paper Fig 7 #194 #197

Merged
merged 5 commits into from
Aug 14, 2024
Merged

Dataset from 24Golden paper Fig 7 #194 #197

merged 5 commits into from
Aug 14, 2024

Conversation

bdeadman
Copy link
Collaborator

* Uploading dataset pbtxt for 24Golden paper Fig 7

https://doi.org/10.1016/j.chempr.2024.04.001

* Add files via upload

This is the revised .pbtxt file reviewed by @bendeadman for Fig. 7 data.

Changes made:

- Elaborated the dataset description
- Added the well identifier to reactions
- added some 'details' to the reaction identifiers and the product LC measurement to give context.
- changed the provenance html to point to the doi link (should be more permanent)

* Delete 24_Golden_Fig_7_dataset.pbtxt
Copy link

Change summary:

Filename Added Removed Changed
data/Fig_7_data_dataset.pbtxt 1 0 0
1 0 0

@bdeadman
Copy link
Collaborator Author

@skearnes @connorcoley

I've already reviewed this data from the Stahl group so it should just need a review to confirm it can move from the branch to main.

@connorcoley
Copy link
Contributor

Is this meant to just be a single reaction?

@bdeadman
Copy link
Collaborator Author

bdeadman commented Aug 3, 2024

It should be 288 reactions. I've downloaded the file from the open-reaction-database:#194 branch to check it again (see below).

image

@bdeadman bdeadman self-assigned this Aug 5, 2024
Count reactions workflow failing so removing the dataset to redo pull.
Copy link

github-actions bot commented Aug 5, 2024

Change summary:

Filename Added Removed Changed
data/a1/ord_dataset-a12fa15d036d489c971b0b514caeae52.pb.gz 288 0 0
288 0 0

@bdeadman
Copy link
Collaborator Author

bdeadman commented Aug 5, 2024

The count_reactions workflow was failing and giving the following error:

INFO 2024-08-05 13:06:56,894 reactions.py:40: data/c2/ord_dataset-c2ad1656a3ca4d08888ffb6e3f3a2742.pb.gz: 2701
HEAD detached at 9aa515c
Untracked files:
(use "git add ..." to include in what will be committed)
ord-schema/

nothing added to commit but untracked files present (use "git add" to track)
warning: redirecting to https://github.com/open-reaction-database/ord-data.git/
Uploading LFS objects: 100% (526/526), 1.2 GB | 0 B/s, done.
To https://github.com/open-reaction-database/ord-data.git
! [rejected] HEAD -> #194 (fetch first)
error: failed to push some refs to 'https://github.com/open-reaction-database/ord-data.git'
hint: Updates were rejected because the remote contains work that you do not
hint: have locally. This is usually caused by another repository pushing to
hint: the same ref. If you want to integrate the remote changes, use
hint: 'git pull' before pushing again.
hint: See the 'Note about fast-forwards' in 'git push --help' for details.
Error: Process completed with exit code 1.

To solve I have removed the new dataset on my local open-reaction-database:#194 repo, committed and pushed the change, then added the file (the post ORD processing version) back in, commit and push again. Now the workflows run again and count_reactions shows the correct 288 added.

@bdeadman
Copy link
Collaborator Author

bdeadman commented Aug 5, 2024

@connorcoley @skearnes The problem with the reaction count has been resolved. How it happened may be worth a discussion at some point, but I'll see if it happens again.

This dataset should be ready for release.

@skearnes skearnes closed this Aug 14, 2024
@skearnes skearnes reopened this Aug 14, 2024
@skearnes skearnes enabled auto-merge (squash) August 14, 2024 21:44
Copy link

Change summary:

Filename Added Removed Changed
data/a1/ord_dataset-a12fa15d036d489c971b0b514caeae52.pb.gz 288 0 0
288 0 0

@skearnes skearnes merged commit 0bf7868 into main Aug 14, 2024
3 of 4 checks passed
@skearnes skearnes deleted the #194 branch August 14, 2024 21:45
bdeadman added a commit that referenced this pull request Sep 26, 2024
#196)

* 1440 suzuki rxns

1440 suzuki rxns from https://doi.org/10.1038/s44160-023-00351-1.

* added first subset of reductive aminations, and added name for suzuki dataset

* Dataset from 24Golden paper Fig 7 #194 (#197)

* Uploading dataset pbtxt for 24Golden paper Fig 7 (#194)

* Uploading dataset pbtxt for 24Golden paper Fig 7

https://doi.org/10.1016/j.chempr.2024.04.001

* Add files via upload

This is the revised .pbtxt file reviewed by @bendeadman for Fig. 7 data.

Changes made:

- Elaborated the dataset description
- Added the well identifier to reactions
- added some 'details' to the reaction identifiers and the product LC measurement to give context.
- changed the provenance html to point to the doi link (should be more permanent)

* Delete 24_Golden_Fig_7_dataset.pbtxt

* Update submission

* Remove dataset to redo pull

Count reactions workflow failing so removing the dataset to redo pull.

* added the dataset back in

* Update badges

---------

Co-authored-by: spencerheins <[email protected]>
Co-authored-by: github-actions <[email protected]>

* now with all 3 dataset files including updates to suzuki and red am

* Rogue quotation mark in description - fixed

* Delete alkylation_merge.pbtxt

* Fixed tag on core_SMILES outcome

* Added dataset name and descriptor to Suzuki

---------

Co-authored-by: spencerheins <[email protected]>
Co-authored-by: github-actions <[email protected]>
bdeadman added a commit that referenced this pull request Oct 4, 2024
* Uploading dataset pbtxt for 24Golden paper Fig 7 (#194)

* Uploading dataset pbtxt for 24Golden paper Fig 7

https://doi.org/10.1016/j.chempr.2024.04.001

* Add files via upload

This is the revised .pbtxt file reviewed by @bendeadman for Fig. 7 data.

Changes made:

- Elaborated the dataset description
- Added the well identifier to reactions
- added some 'details' to the reaction identifiers and the product LC measurement to give context.
- changed the provenance html to point to the doi link (should be more permanent)

* Delete 24_Golden_Fig_7_dataset.pbtxt

* Update submission

* Remove dataset to redo pull

Count reactions workflow failing so removing the dataset to redo pull.

* added the dataset back in

* Update badges

---------

Co-authored-by: spencerheins <[email protected]>
Co-authored-by: github-actions <[email protected]>
bdeadman added a commit that referenced this pull request Oct 4, 2024
skearnes pushed a commit that referenced this pull request Nov 13, 2024
…istry (#206)

* submission 47465522 (#192)

* submission 47465522

* submission 47465522 - update details

* submission 47465522 - update pd catalyst details

* Dataset from 24Golden paper Fig 7 #194 (#197) (#205)

* Uploading dataset pbtxt for 24Golden paper Fig 7 (#194)

* Uploading dataset pbtxt for 24Golden paper Fig 7

https://doi.org/10.1016/j.chempr.2024.04.001

* Add files via upload

This is the revised .pbtxt file reviewed by @bendeadman for Fig. 7 data.

Changes made:

- Elaborated the dataset description
- Added the well identifier to reactions
- added some 'details' to the reaction identifiers and the product LC measurement to give context.
- changed the provenance html to point to the doi link (should be more permanent)

* Delete 24_Golden_Fig_7_dataset.pbtxt

* Update submission

* Remove dataset to redo pull

Count reactions workflow failing so removing the dataset to redo pull.

* added the dataset back in

* Update badges

---------

Co-authored-by: spencerheins <[email protected]>
Co-authored-by: github-actions <[email protected]>

* Revert "Dataset from 24Golden paper Fig 7 #194 (#197) (#205)"

This reverts commit 3efa3c4.

* Replace dataset with edited version

Added addition device details, and some extra details to the catalyst descriptions.

* Updated the dataset

Hopefully fixed the record created timestamp so it will pass tests.

* add dataset name and description back in

* Update record creation timestamp

Trying to avoid validation error where record created is later than record modified - possibly due to American month format!

* Update submission

* Update badges

* update file with dataset name and description

---------

Co-authored-by: Qianxiang Ai <[email protected]>
Co-authored-by: spencerheins <[email protected]>
Co-authored-by: github-actions <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants