Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Des370k monomer lowest energy conformer optimization #421

Merged
merged 6 commits into from
Jan 23, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -307,6 +307,8 @@ These are currently used to find a minimum energy conformation of a molecule.
| `OpenFF Organometallics Exploratory Optimization Dataset` | [2024-12-03-OpenFF-Organometallics-Exploratory-Optimization-Dataset](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-12-03-OpenFF-Organometallics-Exploratory-Optimization-Dataset) | Optimization training data for organometallic molecules | F, P, O, C, Zn, N, Ni, Pt, S, Pd, Mg, Br, Rh, Fe, H, Cl, B, Li | |
| `OpenFF NAGL2 Training Optimization Dataset v4.0` | [2024-12-09-OpenFF-NAGL2-Training-Optimization-Dataset-v4.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-12-09-OpenFF-NAGL2-Training-Optimization-Dataset-v4.0) | Optimization dataset for NAGL2 training, combined and filtered | Si, B, O, I, S, Cl, N, H, C, P, F, Br | |
| `SPICE Dipeptides Lowest E Conformer Optimization Dataset v4.0` | [2025-01-08-SPICE-Dipeptides-Lowest-E-Conformer-Optimization-Dataset-v4.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2025-01-08-SPICE-Dipeptides-Lowest-E-Conformer-Optimization-Dataset-v4.0) | Optimization dataset for the lowest energy conformers of the Dipeptides subset of SPICE | H, S, C, O, N | |
| `SPICE DES370k Monomers Lowest E Conformer Optimization Dataset v4.0` | [2025-01-08-SPICE-DES370k-Monomers-Lowest-E-Conformer-Optimization-Dataset-v4.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2025-01-08-SPICE-DES370k-Monomers-Lowest-E-Conformer-Optimization-Dataset-v4.0) | Optimization dataset for the lowest energy conformer of molecules in the DES370k monomer subset of SPICE | S, O, F, P, Br, N, H, C, Cl, I | |


# TorsionDrive Datasets
These are currently used perform a complete rotation of one or more selected bonds, where optimizations are performed over a discrete set of angles.
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
# SPICE DES370k Monomers Lowest E Conformer Optimization Dataset v4.0

## Description
A dataset containing the lowest energy conformer of all molecules from the [`SPICE DES Monomers Single Points Dataset v1.1`](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2021-11-15-QMDataset-DES-monomers-single-points) dataset, optimized at the OpenFF default level of theory (B3LYP-D3BJ/DZVP). Detailed description on how the original dataset is generated can be found at https://github.com/openmm/qmdataset/tree/main/des370k.

## General information
* Date: 2025-01-08
* Class: OpenFF Optimization Dataset
* Purpose: Conformer optimization
* Name: SPICE DES370k Monomers Lowest E Conformer Optimization Dataset v4.0
* Number of unique molecules: 374
* Number of conformers: 374
* Number of conformers (min, mean, max): 1.00, 1.00, 1.00
* Molecular weight (min, mean, max): 16.04, 95.89, 284.78
* Charges: -1.0 0.0 1.0
* Dataset submitter: Alexandra McIsaac
* Dataset generator: Alexandra McIsaac

## QCSubmit generation pipeline
* `des370k_monomers_minEconf.json`: Dataset containing the minimum energy conformers to use as a starting point for the optimization. Needed as an input file to `generate-dataset.ipynb`
* `generate-dataset.ipynb`: Notebook used to generate dataset

## QCSubmit Manifest
* `dataset.json.bz2`: Compressed dataset ready for submission
* `dataset.pdf`: Visualization of dataset molecules
* `dataset.smi`: Smiles strings for dataset molecules
* `des370k_monomers_minEconf.json`: Dataset containing the minimum energy conformers to use as a starting point for the optimization. Needed as an input file to `generate-dataset.ipynb`
* `generate-dataset.ipynb`: Notebook used to generate dataset
* `input_environment.yaml`: Environment file used to create Python environment for the notebook
* `input_environment_full.yaml`: Fully-resolved environment used to execute the notebook.

## Metadata
* Elements: {S, O, F, P, Br, N, H, C, Cl, I}
* Spec: default-mbis
* basis: DZVP
* implicit_solvent: None
* keywords: {}
* maxiter: 200
* method: B3LYP-D3BJ
* program: psi4
* SCF properties:
* dipole
* quadrupole
* mbis_charges
* wiberg_lowdin_indices
* mayer_indices
Git LFS file not shown
Binary file not shown.
Loading
Loading