Finding the most potent compounds using active learning on molecular pairs.

Beilstein J Org Chem

Department of Biomedical Engineering, Duke University, Durham, NC 27708, USA.

Published: August 2024

Active learning allows algorithms to steer iterative experimentation to accelerate and de-risk molecular optimizations, but actively trained models might still exhibit poor performance during early project stages where the training data is limited and model exploitation might lead to analog identification with limited scaffold diversity. Here, we present ActiveDelta, an adaptive approach that leverages paired molecular representations to predict improvements from the current best training compound to prioritize further data acquisition. We apply the ActiveDelta concept to both graph-based deep (Chemprop) and tree-based (XGBoost) models during exploitative active learning for 99 K benchmarking datasets. We show that both ActiveDelta implementations excel at identifying more potent inhibitors compared to the standard exploitative active learning implementations of Chemprop, XGBoost, and Random Forest. The ActiveDelta approach is also able to identify more chemically diverse inhibitors in terms of their Murcko scaffolds. Finally, deep models such as Chemprop trained on data selected through ActiveDelta approaches can more accurately identify inhibitors in test data created through simulated time-splits. Overall, this study highlights the large potential for molecular pairing approaches to further improve popular active learning strategies in low data regimes by enabling faster and more accurate identification of more diverse molecular hits against critical drug targets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11368049PMC
http://dx.doi.org/10.3762/bjoc.20.185DOI Listing

Publication Analysis

Top Keywords

active learning
20
exploitative active
8
active
5
learning
5
molecular
5
data
5
activedelta
5
finding potent
4
potent compounds
4
compounds active
4

Similar Publications

Small molecules as nanomedicine carriers offer advantages in drug loading and preparation. Selecting effective small molecules for stable nanomedicines is challenging. This study used artificial intelligence (AI) to screen drug combinations for self-assembling nanomedicines, employing physiochemical parameters to predict formation via machine learning.

View Article and Find Full Text PDF

The 2011 report outlined several recommendations for transforming undergraduate biology education, sparking multiple pedagogical reform efforts. Among these was the Promoting Active Learning and Mentoring (PALM) network, an NSF-funded program that provided mentorship and training to instructors on implementing active learning in the classroom. Here, we provide a perspective on how members of the biology education community in PALM view the recommendations of , drawing upon our experiences both as members of PALM and as leaders of an associated project funded by another NSF grant that hosted PALM alumni at various conferences.

View Article and Find Full Text PDF

Numerous barriers hinder the effective delivery of neurologic care as well as the education of health care professionals in the low-income and middle-income countries (LMICs). This study assessed the knowledge of the participants after Comprehensive Neurocritical Care Course (CN3C) in the LMICs. Data from 177 participants were collected and analyzed.

View Article and Find Full Text PDF

Unlabelled: Sarcoidosis is a multisystemic syndrome characterized by non-caseous granulomatous inflammation, although necrotizing sarcoid granulomatosis is considered part of the spectrum of the disease. Drug induced sarcoidosis-like reaction (DISR) is a systemic granulomatous reaction, which is histopathologically identical to primary sarcoidosis - mostly described after the use of biologics like tumour necrosis factor alpha antagonists but also anti-CD20 (rituximab). The authors present the very rare case of a woman with a primary Sjögren's syndrome (pSS) started on rituximab for disease control, which evolved with a 3-year indolent progressive systemic sarcoid reaction.

View Article and Find Full Text PDF

Unlabelled: Upper extremity deep vein thrombosis (UEDVT) is relatively rare, and much less as an initial presentation of systemic lupus erythematosus (SLE). Primary UEDVT should be considered in individuals with unilateral arm swelling where the brachial, axillary, and subclavian veins are frequently involved. SLE is a chronic autoimmune disease that predominantly affects women of childbearing age and of African descent.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!