The rapid expansion of readily accessible compounds over the past six years has transformed molecular docking, improving hit rates and affinities. While many millions of molecules may score well in a docking campaign, the results are rarely fully shared, hindering the benchmarking of machine learning and chemical space exploration methods that seek to explore the expanding chemical spaces. To address this gap, we develop a website providing access to recent large library campaigns, including poses, scores, and results for campaigns against 11 targets, with 6.3 billion molecules docked and 3729 compounds experimentally tested. In a simple proof-of-concept study that speaks to the new library's utility, we use the new database to train machine learning models to predict docking scores and to find the top 0.01% scoring molecules while evaluating only 1% of the library. Even in these proof-of-concept studies, some interesting trends emerge: unsurprisingly, as models train on larger sets, they perform better; less expected, models could achieve high correlations with docking scores and yet still fail to enrich the new docking-discovered ligands, or even the top 0.01% of docking-ranked molecules. It will be interesting to see how these trends develop for methods more sophisticated than the simple proof-of-concept studies undertaken here; the database is openly available at lsd.docking.org.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888352 | PMC |
http://dx.doi.org/10.1101/2025.02.25.639879 | DOI Listing |
JMIR Med Inform
March 2025
LynxCare Inc, Leuven, Belgium.
Background: Processing data from electronic health records (EHRs) to build research-grade databases is a lengthy and expensive process. Modern arthroplasty practice commonly uses multiple sites of care, including clinics and ambulatory care centers. However, most private data systems prevent obtaining usable insights for clinical practice.
View Article and Find Full Text PDFJMIR Res Protoc
March 2025
Institute for Data Science and Informatics, University of Missouri, Columbia, MO, United States.
Background: Amyotrophic lateral sclerosis (ALS) leads to rapid physiological and functional decline before causing untimely death. Current best-practice approaches to interdisciplinary care are unable to provide adequate monitoring of patients' health. Passive in-home sensor systems enable 24×7 health monitoring.
View Article and Find Full Text PDFJ Med Chem
March 2025
State Key Laboratory of Discovery and Utilization of Functional Components in Traditional Chinese Medicine; Shanghai Frontiers Science Center of TCM Chemical Biology; Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China.
The anticancer agent irinotecan often induces severe delayed-onset diarrhea, inhibiting human carboxylesterase 2A (hCES2A) can significantly alleviate irinotecan-triggered gut toxicity (ITGT). This work presents an efficient workflow for design and developing novel efficacious hCES2A inhibitors. A well-training machine learning model identified as a lead compound, while compound was developed as a novel time-dependent hCES2A inhibitor (IC = 0.
View Article and Find Full Text PDFMol Inform
March 2025
Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Vietnam.
Within a recent decade, graph neural network (GNN) has emerged as a powerful neural architecture for various graph-structured data modelling and task-driven representation learning problems. Recent studies have highlighted the remarkable capabilities of GNNs in handling complex graph representation learning tasks, achieving state-of-the-art results in node/graph classification, regression, and generation. However, most traditional GNN-based architectures like GCN and GraphSAGE still faced several challenges related to the capability of preserving the multi-scaled topological structures.
View Article and Find Full Text PDFJ AOAC Int
March 2025
Department of Chemistry, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, K1S 5B6 Canada.
Background: Plant-based milk alternatives (PBMA) are increasingly popular due to rising lactose intolerance and environmental concerns over traditional dairy products. However, limited efforts have been made to develop rapid authentication methods to verify their biological origin.
Objective: In this study, we developed a rapid, on-site analytical method for the authentication and identification of PBMA made by six different plant species utilizing a portable Raman spectrometer coupled with machine learning.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!