Enantioselective catalytic reactions have a significant impact on chemical synthesis, and they are important components in an experimental chemist's toolbox. However, development of asymmetric catalysts often relies on the chemical intuition and experience of a synthetic chemist, making the process both time-consuming and resource-intensive. The machine-learning-assisted reaction discovery can serve as a very efficient platform for obtaining high-performing catalysts in a time-economical manner without extensive experimentation. Herein, we report a data-driven and machine learning method for reliably predicting enantiomeric excess (%ee) of 211 asymmetric Pauson-Khand reactions (PKR 1-PKR 211) between a variety of 45 unique 1,6-enyne substrates and 12 unique axially chiral biaryl ligands in the presence of different reaction conditions like varying CO gas pressure, temperature, and solvent polarity. Four different machine learning algorithms have been studied: extreme gradient boosting (XGBoost), random forest (RF), light gradient boosting machine (LGBM), and neural network (NN). A fivefold cross validation method was applied to our k-means SMOTE-augmented data set to obtain the optimized hyperparameters for the training set, and subsequently, these parameters were used in the test data set. In the case of the out-of-box set, the XGBoost method is found to be superior among all four machine learning methods investigated. Our out-of-box samples contain a total of 12 unique asymmetric Pauson-Khand reactions (PKR 212-PKR 223) arising from three new 1,3-benzodioxole-based SEGPHOS catalysts, which were never included in the training set. The XGBoost algorithm shows an impressive root mean square error (RMSE) of 7.06 (±1.11) in predicting %ee. The XGBoost-predicted %ee values match reasonably well with the experimental results. The absolute difference between the experimental and XGBoost-calculated %ee values ranges from 0.9 to 7.6 for the majority of the out-of-box Pauson-Khand reactions. The reactions with fluoro-substituted-SEGPHOS ligand shows smaller deviations from the experimental %ee values compared to the reactions with and catalysts where the benzodioxole units do not have fluorine atoms. Finally, we have discovered a library of 3357 lead reactions with excellent %ee (≥99) by engaging the experimentally unknown combinations of the catalysts, substrates, and reaction conditions. The axially chiral biaryl catalysts and enyne substrates present in the library are synthetically accessible. The ligand space in the library is dominated by the presence of tol-BINAP and the DTBM-OMe-BIPHEP ligands. The substrate space is predominantly occupied by NTs-tethered, O-tethered, NBn-tethered, and C(COMe)-tethered 1,6-enynes that have an H or methyl functional group present in the alkyne unit. Our newly discovered library assists a synthetic chemist to develop a highly enantioselective PKR by starting with knowledge without extensive trial-and-error experimentation.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jpca.4c06701DOI Listing

Publication Analysis

Top Keywords

pauson-khand reactions
16
asymmetric pauson-khand
12
machine learning
12
%ee values
12
reactions
8
synthetic chemist
8
reactions pkr
8
axially chiral
8
chiral biaryl
8
reaction conditions
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!