ChemSpaceAL: An Efficient Active Learning Methodology Applied to Protein-Specific Molecular Generation.

J Chem Inf Model

Department of Chemistry, Yale University, New Haven, Connecticut 06511-8499, United States.

Published: February 2024

The incredible capabilities of generative artificial intelligence models have inevitably led to their application in the domain of drug discovery. Within this domain, the vastness of chemical space motivates the development of more efficient methods for identifying regions with molecules that exhibit desired characteristics. In this work, we present a computationally efficient active learning methodology and demonstrate its applicability to targeted molecular generation. When applied to c-Abl kinase, a protein with FDA-approved small-molecule inhibitors, the model learns to generate molecules similar to the inhibitors without prior knowledge of their existence and even reproduces two of them exactly. We also show that the methodology is effective for a protein without any commercially available small-molecule inhibitors, the HNH domain of the CRISPR-associated protein 9 (Cas9) enzyme. To facilitate implementation and reproducibility, we made all of our software available through the open-source ChemSpaceAL Python package.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.3c01456DOI Listing

Publication Analysis

Top Keywords

efficient active
8
active learning
8
learning methodology
8
molecular generation
8
small-molecule inhibitors
8
chemspaceal efficient
4
methodology applied
4
applied protein-specific
4
protein-specific molecular
4
generation incredible
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!