In addition to Jupyter, IPython and Python 3.6 (or higher) the following libraries are required:
- Matplotlib
- Numpy
- Pandas
- RDKit
- Scikit-Learn
- Seaborn
The monoprotic pKa dataset was derived from the raw data of ChEMBL25 and DataWarrior. The pKa calculator plugin of Marvin[1] was used for pKa prediction and for the determination of the titratable groups.
Every SDF entry has four properties:
- ID - A unique identifier over this dataset
- pKa - The pKa value retrieved from DataWarrior and ChEMBL25 (may be averaged)
- pKa_CX - The pKa value predicted by Marvin[1]
- atom - The atom index of the titratable group
Marcel Baltruschat - GitHub, E-Mail
The content of this folder is licensed under the MIT License - see the LICENSE.md file for details.
[1] Marvin 19.15.0, 2019, ChemAxon, http://www.chemaxon.com