We introduce an exploratory active learning (AL) algorithm using Gaussian process regression and marginalized graph kernel (GPR-MGK) to sample chemical compound space (CCS) at minimal cost. Targeting 251,728 enumerated alkane molecules with 4-19 carbon atoms, we applied the AL algorithm to select a diverse and representative set of molecules and then conducted high-throughput molecular simulations on these selected molecules. To demonstrate the power of the AL algorithm, we built directed message-passing neural networks (D-MPNN) using simulation data as the training set to predict liquid densities, heat capacities, and vaporization enthalpies of the CCS. Validations show that D-MPNN models built on the smallest training set considered in this work, which consists of 313 molecules or 0.124% of the original CCS, predict the properties with > 0.99 against the computational data and > 0.94 against the experimental data. The advantage of the presented AL algorithm is that the predicted uncertainty of GPR depends on only the molecular structures, which renders it compatible with high-throughput data generation.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.3c01430DOI Listing

Publication Analysis

Top Keywords

chemical compound
8
compound space
8
active learning
8
alkane molecules
8
training set
8
molecules
5
efficient exploration
4
exploration chemical
4
space active
4
learning prediction
4

Similar Publications

A novel ternary boride, NiPtB ( = 0.5), was obtained by argon-arc melting of the elements followed by annealing at 750 °C. It exhibits a new structure type with the space group ( = 2.

View Article and Find Full Text PDF

Highly Optimized CNS Penetrant Inhibitors of EGFR Exon20 Insertion Mutations.

J Med Chem

January 2025

Pharmaron Beijing Co., Ltd., 6 Taihe Road, BDA, Beijing 100176, P. R. China.

Despite recent advances in the inhibition of EGFR (epidermal growth factor receptor), there remains a clinical need for new EGFR Exon20 insertion (Ex20Ins) inhibitors that spare EGFR WT. Herein, we report the discovery and optimization of two chemical series leading to ether and biaryl as potent, selective, and brain-penetrant inhibitors of Ex20Ins mutants. Building on our earlier discovery of alkyne which allowed access to CNS property space for an Ex20Ins inhibitor, we utilized structure-based design to move to lower lipophilicity and lower CL compounds while maintaining a WT selectivity margin.

View Article and Find Full Text PDF

Although banned long ago in many countries and jurisdictions, the organochlorine pesticide dichlorodiphenyltrichloroethane (DDT) and compounds related to it remain in the aquatic environment, particularly in sediments, and can pose risks to aquatic life. To inform ecological risk assessment of these compounds, we tested the toxicity of six DDT congeners, specifically the p, p' (4,4') forms of DDT, dichlorodiphenyldichloroethylene (DDE), dichlorodiphenyldichloroethane (DDD), and dichlorodiphenylchloroethylene (DDMU), as well as the o, p' (2,4') isomers of DDT and DDD. The epibenthic amphipod, Hyalella azteca, was exposed for 7 days to waterborne chemical and assessed for changes in survival and growth.

View Article and Find Full Text PDF

This study presents T-1-NBAB, a new compound derived from the natural xanthine alkaloid theobromine, aimed at inhibiting VEGFR-2, a crucial protein in angiogenesis. T-1-NBAB's potential to interacts with and inhibit the VEGFR-2 was indicated using in silico techniques like molecular docking, MD simulations, MM-GBSA, PLIP, essential dynamics, and bi-dimensional projection experiments. DFT experiments was utilized also to study the structural and electrostatic properties of T-1-NBAB.

View Article and Find Full Text PDF

Inorganic photochromic materials offer several advantages over organic compounds, including relatively inexpensive and higher thermal stability. However, tuning their color with the same component has remained a significant challenge. In this study, we demonstrate that the photochromic color of Cu-doped ZnS nanocrystals (NCs), which is initially pale yellow before light irradiation, can be tuned from gray to brown by adjusting the surface stoichiometry of Zn and S, which is controlled through the use of thiol and non-thiol ligands.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!