Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize the generation of guests via gradient descent. Finally the guests are converted to SMILES using a transformer. The successful practical application of our model to established molecular host systems, cucurbit[n]uril and metal-organic cages, resulted in the discovery of 9 previously validated guests for CB[6] and 7 unreported guests (with association constant K ranging from 13.5 M to 5,470 M) and the discovery of 4 unreported guests for [Pd1] (with K ranging from 44 M to 529 M).
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10965440 | PMC |
http://dx.doi.org/10.1038/s43588-024-00602-x | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!