Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
…task into main
  • Loading branch information
marcosfelt committed Sep 10, 2021
2 parents 8d27b13 + 22c25f9 commit 0399909
Show file tree
Hide file tree
Showing 54 changed files with 3,989 additions and 1,208 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ Quantification
- [ ] [Christensen](https://chemrxiv.org/articles/preprint/Data-science_driven_autonomous_process_optimization/13146404): 1 case varying ligand, temperature, catalyst loading and ligand ratio to optimize formation of one product over two others

* C-N Cross Coupling Reaction
- [ ] [Baumgartner C-N](https://pubs.acs.org/doi/10.1021/acs.oprd.9b00236): 4 cases with varying catalysts, bases, temperature, residence time, base equivalents and optimizing yield and TON
- [x] [Baumgartner C-N](https://pubs.acs.org/doi/10.1021/acs.oprd.9b00236): 4 cases with varying catalysts, bases, temperature, residence time, base equivalents and optimizing yield and TON
- [x] [Ahneman](https://science.sciencemag.org/content/360/6385/186): HT screening 15 cases with varying additives, catalysts, bases
- [x] [Buitrago-Santinilla](https://science.sciencemag.org/content/347/6217/49): HT Screening 5 cases varying catalyst, catalyst loading, base, and base loading.
- [ ] [Bédard](https://science.sciencemag.org/content/361/6408/1220.full) - 1 case varying temperature, mol% of catlayst, and flowrates of different pumps to optimize conversion to product (see page 71 of the [SI](https://science.sciencemag.org/content/sci/suppl/2018/09/19/361.6408.1220.DC1/aat0650_Bedard_SM.pdf))
Expand Down
1 change: 1 addition & 0 deletions data/.gitignore
Original file line number Diff line number Diff line change
@@ -1 +1,2 @@
/solvent_vapor_pressures_cosmo.xlsx
/fey_ligand_database.xlsx
Binary file modified data/baumgartner_C-N/baumgartner_C-N.pb
Binary file not shown.
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"domain": [{"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Catalyst\"].components[0].identifiers[0].value", "description": "Catalyst", "units": null, "levels": ["cycloPd tBuXPhos 4-Chlorotoluene", "cycloPd EPhos 4-Chlorotoluene", "cycloPd AlPhos 4-Chlorotoluene", "cycloPd tBuBrettPhos 4-Chlorotoluene"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Nucleophile\"].components[0].identifiers[0].value", "description": "Nucleophile", "units": null, "levels": ["Aniline", "Benzamide", "Phenethylamine", "Morpholine"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Base\"].components[0].identifiers[0].value", "description": "Base", "units": null, "levels": ["TEA", "Triethylamine", "TMG", "BTMG", "DBU", "MTBD", "BTTP", "P2Et"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Solvent\"].components[0].identifiers[0].value", "description": "Solvent", "units": null, "levels": ["2-MeTHF", "DMSO"], "ds": null}, {"type": "ContinuousVariable", "is_objective": false, "name": "outcomes[0].reaction_time.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [60.0, 6000.0]}, {"type": "ContinuousVariable", "is_objective": false, "name": "conditions.temperature.setpoint.value", "description": "Reactor temperature in degrees Celsius (\u00baC)", "units": null, "bounds": [30.0, 110.0]}, {"type": "ContinuousVariable", "is_objective": true, "name": "outcomes[0].products[0].measurements[0].percentage.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [0.0, 105.0]}], "name": "ExperimentalEmulator", "data": {"index": [0], "columns": [["inputs[\"Catalyst\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Nucleophile\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Base\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Solvent\"].components[0].identifiers[0].value", "DATA"], ["outcomes[0].reaction_time.value", "DATA"], ["conditions.temperature.setpoint.value", "DATA"], ["outcomes[0].products[0].measurements[0].percentage.value", "DATA"], ["computation_t", "METADATA"], ["experiment_t", "METADATA"], ["strategy", "METADATA"]], "data": [["cycloPd EPhos 4-Chlorotoluene", "Aniline", "BTMG", "2-MeTHF", 1324, 65.0, 68.71260833740234, 0.0, 0.024942874908447266, NaN]]}, "experiment_params": {"model_name": "baumgartner_cn", "regressor_name": "ANNRegressor", "n_features": 20, "n_examples": 374, "descriptors_features": [], "output_variable_names": ["outcomes[0].products[0].measurements[0].percentage.value"], "predictors": [{"input_preprocessor": {"num": {"mean_": [746.8027275711743, 73.82276115132801], "var_": [369664.67038623523, 849.8593683038238], "scale_": [608.0005513042198, 29.152347560768135], "n_samples_seen_": 268}}, "output_preprocessor": {"mean_": [38.034795302147074], "var_": [1551.1531993598023], "scale_": [39.38468229349835], "n_samples_seen_": 268}}, {"input_preprocessor": {"num": {"mean_": [736.0598635478534, 73.3940519737045], "var_": [346394.2997632705, 866.8982522099044], "scale_": [588.552716214334, 29.443135909918027], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [38.3235550597864], "var_": [1588.4836882468326], "scale_": [39.85578613259099], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [753.3131704791327, 73.40669142712447], "var_": [373756.4778877171, 869.5969794410122], "scale_": [611.3562610194788, 29.488929777816832], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [37.457775622024634], "var_": [1590.6013175027522], "scale_": [39.88234343042986], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [746.5872545490478, 74.53085501575116], "var_": [346883.3379362028, 866.0571520821815], "scale_": [588.968027940569, 29.428848976509112], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [39.98359246302714], "var_": [1639.9363164841907], "scale_": [40.49612717883268], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [795.664763581797, 75.75947952270508], "var_": [360989.5256245267, 827.417576936784], "scale_": [600.8240388204575, 28.764867059257966], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [42.828422761955004], "var_": [1620.5798099404328], "scale_": [40.25642569752601], "n_samples_seen_": 269}}], "clip": true}, "extras": [{}, {}]}
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified data/baumgartner_C-N/op9b00236_si_002.xlsx
Binary file not shown.
1 change: 1 addition & 0 deletions data/baumgartner_C-N/results/SOBO_0.json
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"runner": {"num_initial_experiments": null, "max_iterations": 20, "batch_size": 1, "f_tol": 1e-05, "max_restarts": 0}, "strategy": {"name": "SOBO", "transform": {"transform_domain": [{"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Catalyst\"].components[0].identifiers[0].value", "description": "Catalyst", "units": null, "levels": ["cycloPd tBuXPhos 4-Chlorotoluene", "cycloPd EPhos 4-Chlorotoluene", "cycloPd AlPhos 4-Chlorotoluene", "cycloPd tBuBrettPhos 4-Chlorotoluene"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Nucleophile\"].components[0].identifiers[0].value", "description": "Nucleophile", "units": null, "levels": ["Aniline", "Benzamide", "Phenethylamine", "Morpholine"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Base\"].components[0].identifiers[0].value", "description": "Base", "units": null, "levels": ["TEA", "Triethylamine", "TMG", "BTMG", "DBU", "MTBD", "BTTP", "P2Et"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Solvent\"].components[0].identifiers[0].value", "description": "Solvent", "units": null, "levels": ["2-MeTHF", "DMSO"], "ds": null}, {"type": "ContinuousVariable", "is_objective": false, "name": "outcomes[0].reaction_time.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [60.0, 6000.0]}, {"type": "ContinuousVariable", "is_objective": false, "name": "conditions.temperature.setpoint.value", "description": "Reactor temperature in degrees Celsius (\u00baC)", "units": null, "bounds": [30.0, 110.0]}, {"type": "ContinuousVariable", "is_objective": true, "name": "outcomes[0].products[0].measurements[0].percentage.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [0.0, 105.0]}], "name": "Transform", "domain": [{"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Catalyst\"].components[0].identifiers[0].value", "description": "Catalyst", "units": null, "levels": ["cycloPd tBuXPhos 4-Chlorotoluene", "cycloPd EPhos 4-Chlorotoluene", "cycloPd AlPhos 4-Chlorotoluene", "cycloPd tBuBrettPhos 4-Chlorotoluene"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Nucleophile\"].components[0].identifiers[0].value", "description": "Nucleophile", "units": null, "levels": ["Aniline", "Benzamide", "Phenethylamine", "Morpholine"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Base\"].components[0].identifiers[0].value", "description": "Base", "units": null, "levels": ["TEA", "Triethylamine", "TMG", "BTMG", "DBU", "MTBD", "BTTP", "P2Et"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Solvent\"].components[0].identifiers[0].value", "description": "Solvent", "units": null, "levels": ["2-MeTHF", "DMSO"], "ds": null}, {"type": "ContinuousVariable", "is_objective": false, "name": "outcomes[0].reaction_time.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [60.0, 6000.0]}, {"type": "ContinuousVariable", "is_objective": false, "name": "conditions.temperature.setpoint.value", "description": "Reactor temperature in degrees Celsius (\u00baC)", "units": null, "bounds": [30.0, 110.0]}, {"type": "ContinuousVariable", "is_objective": true, "name": "outcomes[0].products[0].measurements[0].percentage.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [0.0, 105.0]}], "transform_params": {}}, "strategy_params": {"prev_param": [[[2.0, 2.0, 1.0, 1.0, 399.0482197121444, 56.49147371770552], [2.0, 0.0, 1.0, 0.0, 5414.154277163791, 42.280958054266016], [2.0, 1.0, 0.0, 0.0, 1234.2344944494523, 50.32158545595374], [1.0, 2.0, 5.0, 1.0, 3190.2882414082683, 63.333951597200226], [1.0, 3.0, 4.0, 1.0, 4392.097900353618, 95.17589818248152], [2.0, 0.0, 5.0, 1.0, 728.1995418633819, 106.88469793990654], [2.0, 0.0, 7.0, 0.0, 1700.148564326674, 71.51446526963073], [2.0, 0.0, 5.0, 1.0, 2323.9572067700406, 77.90178887642749], [2.0, 0.0, 1.0, 1.0, 4532.255891885944, 69.88644060594295], [2.0, 0.0, 0.0, 0.0, 215.40219615376333, 57.842070376245445], [2.0, 0.0, 4.0, 1.0, 1280.0203078830948, 37.691391309954064], [2.0, 0.0, 3.0, 1.0, 2863.890642678816, 98.72493909290984], [2.0, 0.0, 5.0, 1.0, 2667.379360942511, 43.635876298831555], [2.0, 0.0, 0.0, 1.0, 3563.7693118193156, 109.9752194849526], [2.0, 0.0, 4.0, 1.0, 4609.432262508466, 30.02511609744029], [2.0, 0.0, 5.0, 0.0, 5650.123783417311, 40.62168237073279], [2.0, 0.0, 1.0, 0.0, 5683.12127316485, 73.43528074637385], [2.0, 0.0, 0.0, 0.0, 3647.7256074465645, 70.87152706633289], [2.0, 0.0, 1.0, 1.0, 2561.460111178853, 84.50880645842187]], [[-26.016122817993164], [-66.72112274169922], [-3.6456189155578613], [-10.622346878051758], [-0.0], [-105.0], [-96.91004943847656], [-104.06868743896484], [-78.92799377441406], [-17.2855224609375], [-67.73985290527344], [-104.14290618896484], [-96.22299194335938], [-37.79090118408203], [-61.27240753173828], [-72.12566375732422], [-67.47439575195312], [-34.51268768310547], [-92.35279846191406]]], "use_descriptors": false, "gp_model_type": "GP", "acquisition_type": "EI", "optimizer_type": "lbfgs", "evaluator_type": "random", "kernel": {"input_dim": 6, "active_dims": [0, 1, 2, 3, 4, 5], "name": "Mat52", "useGPU": false, "variance": [1.2206187567048152], "lengthscale": [1.1533204545350713], "ARD": false, "class": "GPy.kern.Matern52"}, "exact_feval": false, "ARD": true, "standardize_outputs": true}}, "experiment": {"domain": [{"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Catalyst\"].components[0].identifiers[0].value", "description": "Catalyst", "units": null, "levels": ["cycloPd tBuXPhos 4-Chlorotoluene", "cycloPd EPhos 4-Chlorotoluene", "cycloPd AlPhos 4-Chlorotoluene", "cycloPd tBuBrettPhos 4-Chlorotoluene"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Nucleophile\"].components[0].identifiers[0].value", "description": "Nucleophile", "units": null, "levels": ["Aniline", "Benzamide", "Phenethylamine", "Morpholine"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Base\"].components[0].identifiers[0].value", "description": "Base", "units": null, "levels": ["TEA", "Triethylamine", "TMG", "BTMG", "DBU", "MTBD", "BTTP", "P2Et"], "ds": null}, {"type": "CategoricalVariable", "is_objective": false, "name": "inputs[\"Solvent\"].components[0].identifiers[0].value", "description": "Solvent", "units": null, "levels": ["2-MeTHF", "DMSO"], "ds": null}, {"type": "ContinuousVariable", "is_objective": false, "name": "outcomes[0].reaction_time.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [60.0, 6000.0]}, {"type": "ContinuousVariable", "is_objective": false, "name": "conditions.temperature.setpoint.value", "description": "Reactor temperature in degrees Celsius (\u00baC)", "units": null, "bounds": [30.0, 110.0]}, {"type": "ContinuousVariable", "is_objective": true, "name": "outcomes[0].products[0].measurements[0].percentage.value", "description": "Residence time in seconds (s)", "units": null, "bounds": [0.0, 105.0]}], "name": "ExperimentalEmulator", "data": {"index": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "columns": [["inputs[\"Catalyst\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Nucleophile\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Base\"].components[0].identifiers[0].value", "DATA"], ["inputs[\"Solvent\"].components[0].identifiers[0].value", "DATA"], ["outcomes[0].reaction_time.value", "DATA"], ["conditions.temperature.setpoint.value", "DATA"], ["outcomes[0].products[0].measurements[0].percentage.value", "DATA"], ["computation_t", "METADATA"], ["experiment_t", "METADATA"], ["strategy", "METADATA"]], "data": [["cycloPd AlPhos 4-Chlorotoluene", "Phenethylamine", "Triethylamine", "DMSO", 399.0482197121444, 56.49147371770552, 26.016122817993164, 0.0, 0.04274415969848633, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "Triethylamine", "2-MeTHF", 5414.154277163791, 42.280958054266016, 66.72112274169922, 0.18682599067687988, 0.03174996376037598, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Benzamide", "TEA", "2-MeTHF", 1234.2344944494523, 50.32158545595374, 3.6456189155578613, 0.19696807861328125, 0.02604198455810547, "Single-objective BayOpt"], ["cycloPd EPhos 4-Chlorotoluene", "Phenethylamine", "MTBD", "DMSO", 3190.2882414082683, 63.333951597200226, 10.622346878051758, 0.24962091445922852, 0.030314922332763672, "Single-objective BayOpt"], ["cycloPd EPhos 4-Chlorotoluene", "Morpholine", "DBU", "DMSO", 4392.097900353618, 95.17589818248152, 0.0, 0.264434814453125, 0.03364920616149902, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "MTBD", "DMSO", 728.1995418633819, 106.88469793990654, 105.0, 0.3412632942199707, 0.039507150650024414, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "P2Et", "2-MeTHF", 1700.148564326674, 71.51446526963073, 96.91004943847656, 0.35016894340515137, 0.04041600227355957, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "MTBD", "DMSO", 2323.9572067700406, 77.90178887642749, 104.06868743896484, 0.344677209854126, 0.04823803901672363, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "Triethylamine", "DMSO", 4532.255891885944, 69.88644060594295, 78.92799377441406, 0.3300302028656006, 0.03768587112426758, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "TEA", "2-MeTHF", 215.40219615376333, 57.842070376245445, 17.2855224609375, 0.3462550640106201, 0.034728050231933594, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "DBU", "DMSO", 1280.0203078830948, 37.691391309954064, 67.73985290527344, 0.471066951751709, 0.044042110443115234, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "BTMG", "DMSO", 2863.890642678816, 98.72493909290984, 104.14290618896484, 0.3572359085083008, 0.044190168380737305, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "MTBD", "DMSO", 2667.379360942511, 43.635876298831555, 96.22299194335938, 0.2974679470062256, 0.033499956130981445, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "TEA", "DMSO", 3563.7693118193156, 109.9752194849526, 37.79090118408203, 0.31289005279541016, 0.03141593933105469, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "DBU", "DMSO", 4609.432262508466, 30.02511609744029, 61.27240753173828, 0.2619178295135498, 0.03030705451965332, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "MTBD", "2-MeTHF", 5650.123783417311, 40.62168237073279, 72.12566375732422, 0.3048079013824463, 0.043280839920043945, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "Triethylamine", "2-MeTHF", 5683.12127316485, 73.43528074637385, 67.47439575195312, 0.2837979793548584, 0.02958393096923828, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "TEA", "2-MeTHF", 3647.7256074465645, 70.87152706633289, 34.51268768310547, 0.26454877853393555, 0.034379005432128906, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "Triethylamine", "DMSO", 2561.460111178853, 84.50880645842187, 92.35279846191406, 0.2748081684112549, 0.028705120086669922, "Single-objective BayOpt"], ["cycloPd AlPhos 4-Chlorotoluene", "Aniline", "TEA", "2-MeTHF", 655.9685652776594, 31.391605878453596, 21.527576446533203, 0.28366804122924805, 0.02661871910095215, "Single-objective BayOpt"]]}, "experiment_params": {"model_name": "baumgartner_cn", "regressor_name": "ANNRegressor", "n_features": 20, "n_examples": 374, "descriptors_features": [], "output_variable_names": ["outcomes[0].products[0].measurements[0].percentage.value"], "predictors": [{"input_preprocessor": {"num": {"mean_": [746.8027275711743, 73.82276115132801], "var_": [369664.67038623523, 849.8593683038238], "scale_": [608.0005513042198, 29.152347560768135], "n_samples_seen_": 268}}, "output_preprocessor": {"mean_": [38.034795302147074], "var_": [1551.1531993598023], "scale_": [39.38468229349835], "n_samples_seen_": 268}}, {"input_preprocessor": {"num": {"mean_": [736.0598635478534, 73.3940519737045], "var_": [346394.2997632705, 866.8982522099044], "scale_": [588.552716214334, 29.443135909918027], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [38.3235550597864], "var_": [1588.4836882468326], "scale_": [39.85578613259099], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [753.3131704791327, 73.40669142712447], "var_": [373756.4778877171, 869.5969794410122], "scale_": [611.3562610194788, 29.488929777816832], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [37.457775622024634], "var_": [1590.6013175027522], "scale_": [39.88234343042986], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [746.5872545490478, 74.53085501575116], "var_": [346883.3379362028, 866.0571520821815], "scale_": [588.968027940569, 29.428848976509112], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [39.98359246302714], "var_": [1639.9363164841907], "scale_": [40.49612717883268], "n_samples_seen_": 269}}, {"input_preprocessor": {"num": {"mean_": [795.664763581797, 75.75947952270508], "var_": [360989.5256245267, 827.417576936784], "scale_": [600.8240388204575, 28.764867059257966], "n_samples_seen_": 269}}, "output_preprocessor": {"mean_": [42.828422761955004], "var_": [1620.5798099404328], "scale_": [40.25642569752601], "n_samples_seen_": 269}}], "clip": true}, "extras": [{}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {}]}}
Loading

0 comments on commit 0399909

Please sign in to comment.