Machine Learning-Driven Data Valuation for Optimizing High-Throughput Screening Pipelines.

J Chem Inf Model

Technical University of Munich, TUM School of Natural Sciences, Department of Bioscience, Center for Functional Protein Assemblies (CPA), 85748 Garching bei München, Germany.

Published: November 2024

In the rapidly evolving field of drug discovery, high-throughput screening (HTS) is essential for identifying bioactive compounds. This study introduces a novel application of data valuation, a concept for evaluating the importance of data points based on their impact, to enhance drug discovery pipelines. Our approach improves active learning for compound library screening, robustly identifies true and false positives in HTS data, and identifies important inactive samples in an imbalanced HTS training, all while accounting for computational efficiency. We demonstrate that importance-based methods enable more effective batch screening, reducing the need for extensive HTS. Machine learning models accurately differentiate true biological activity from assay artifacts, streamlining the drug discovery process. Additionally, importance undersampling aids in HTS data set balancing, improving machine learning performance without omitting crucial inactive samples. These advancements could significantly enhance the efficiency and accuracy of drug development.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11558681PMC
http://dx.doi.org/10.1021/acs.jcim.4c01547DOI Listing

Publication Analysis

Top Keywords

drug discovery
12
data valuation
8
high-throughput screening
8
hts data
8
inactive samples
8
machine learning
8
data
5
hts
5
machine learning-driven
4
learning-driven data
4

Similar Publications

Leveraging Structural and Computational Biology for Molecular Glue Discovery.

J Med Chem

January 2025

Experimental Drug Development Centre, Chromos, Agency for Science, Technology and Research, 10 Biopolis Road, #05-01, Singapore 138670.

The discovery of molecular glues has made significant strides, unlocking new avenues for targeted protein degradation as a therapeutic strategy, thereby expanding the scope of drug discovery into territories previously considered undruggable. Pioneering molecules like thalidomide and its derivatives have paved the way for the development of small molecules that can induce specific protein degradation by hijacking the cellular ubiquitin-proteasome system. Recent advancements have focused on expanding the range of E3 ligases and target proteins that can be modulated by molecular glues.

View Article and Find Full Text PDF

Semi-Synthesis of Dimeric Cannabidiol Derivatives and Evaluation of their Affinity at Neurological Targets.

J Nat Prod

January 2025

Department of Drug Discovery and Biomedical Sciences, College of Pharmacy, University of South Carolina, Columbia, South Carolina 29208, United States.

Cannabidiol (CBD) is a natural product associated with a wide range of biological and therapeutic activities. Despite the widespread cultural acceptance of CBD as a medicinal agent, much remains to be determined regarding its precise mechanism(s) of action in treating multiple conditions. CBD has been shown to promiscuously interact with several neurological targets with varying affinities.

View Article and Find Full Text PDF

Targeting Protein-Protein Interactions in Hematologic Malignancies.

Annu Rev Pathol

January 2025

Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA; email:

Over the last two decades, there have been extensive efforts to develop small-molecule inhibitors of protein-protein interactions (PPIs) as novel therapeutics for cancer, including hematologic malignancies. Despite the numerous challenges associated with developing PPI inhibitors, a significant number of them have advanced to clinical studies in hematologic patients in recent years. The US Food and Drug Administration approval of the very first PPI inhibitor, venetoclax, demonstrated the real clinical value of blocking protein-protein interfaces.

View Article and Find Full Text PDF

Human African trypanosomiasis (HAT) is one of the most lethal of the neglected tropical diseases. While the discovery of a novel antitrypanosomal drug is highly desired, the creation of a superior lead compound is challenging. Herein we report ukabamide (), which was isolated from a marine sp.

View Article and Find Full Text PDF

Background: The breakthrough discovery of novel biomarkers with prognostic and diagnostic value enables timely medical intervention for the survival of patients diagnosed with gastric cancer (GC). Typically, in studies focused on biomarker analysis, highly connected nodes (hubs) within the protein-protein interaction network (PPIN) are proposed as potential biomarkers. However, this study revealed an unexpected finding following the clustering of network nodes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!