Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize the generation of guests via gradient descent. Finally the guests are converted to SMILES using a transformer. The successful practical application of our model to established molecular host systems, cucurbit[n]uril and metal-organic cages, resulted in the discovery of 9 previously validated guests for CB[6] and 7 unreported guests (with association constant K ranging from 13.5 M to 5,470 M) and the discovery of 4 unreported guests for [Pd1] (with K ranging from 44 M to 529 M).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10965440PMC
http://dx.doi.org/10.1038/s43588-024-00602-xDOI Listing

Publication Analysis

Top Keywords

host-guest binders
8
electron density
8
unreported guests
8
guests
5
electron density-based
4
density-based gpt
4
gpt optimization
4
optimization suggestion
4
suggestion host-guest
4
binders machine
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!