Designing chemically novel and synthesizable ligands from the largest possible chemical space is a major issue in modern drug discovery to identify early hits that are easily amenable to medicinal chemistry optimization. Starting from the sole three-dimensional structure of a protein binding site, we herewith describe a fully automated active learning protocol to propose the commercial chemical reagents and one-step organic chemistry reactions necessary to enumerate target-specific primary hits from ultralarge chemical spaces. When applied in different scenarios (single transform and multiple transforms) addressing chemical spaces of various sizes (from 670 million to 4.5 billion compounds), the method was able to recover up to 98% of virtual hits discovered by an exhaustive docking-based approach while scanning only 5% of the full chemical space. It is therefore applicable to the structure-based screening of trillion-sized chemical spaces at a very high throughput with minimal computational resources.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.4c02097DOI Listing

Publication Analysis

Top Keywords

chemical spaces
16
active learning
8
reagents one-step
8
one-step organic
8
organic chemistry
8
chemistry reactions
8
hits ultralarge
8
ultralarge chemical
8
chemical space
8
chemical
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!