Machine Learning Distinguishes with High Accuracy between Pan-Assay Interference Compounds That Are Promiscuous or Represent Dark Chemical Matter.

J Med Chem

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Endenicher Allee 19c , Rheinische Friedrich-Wilhelms-Universität, D-53115 Bonn , Germany.

Published: November 2018

Assay interference compounds give rise to false-positives and cause substantial problems in medicinal chemistry. Nearly 500 compound classes have been designated as pan-assay interference compounds (PAINS), which typically occur as substructures in other molecules. The structural environment of PAINS substructures is likely to play an important role for their potential reactivity. Given the large number of PAINS and their highly variable structural contexts, it is difficult to study context dependence on the basis of expert knowledge. Hence, we applied machine learning to predict PAINS that are promiscuous and distinguish them from others that are mostly inactive. Surprisingly accurate models can be derived using different methods such as support vector machines, random forests, or deep neural networks. Moreover, structural features that favor correct predictions have been identified, mapped, and categorized, shedding light on the structural context dependence of PAINS effects. The machine learning models presented herein further extend the capacity of PAINS filters.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jmedchem.8b01404DOI Listing

Publication Analysis

Top Keywords

machine learning
12
interference compounds
12
pan-assay interference
8
context dependence
8
pains
6
learning distinguishes
4
distinguishes high
4
high accuracy
4
accuracy pan-assay
4
compounds promiscuous
4

Similar Publications

Imaging-based spatial transcriptomics (iST), such as MERFISH, CosMx SMI, and Xenium, quantify gene expression level across cells in space, but more importantly, they directly reveal the subcellular distribution of RNA transcripts at the single-molecule resolution. The subcellular localization of RNA molecules plays a crucial role in the compartmentalization-dependent regulation of genes within individual cells. Understanding the intracellular spatial distribution of RNA for a particular cell type thus not only improves the characterization of cell identity but also is of paramount importance in elucidating unique subcellular regulatory mechanisms specific to the cell type.

View Article and Find Full Text PDF

A comprehensive benchmarking for evaluating TCR embeddings in modeling TCR-epitope interactions.

Brief Bioinform

November 2024

Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon Tong, Hong Kong, 999077, China.

The complexity of T cell receptor (TCR) sequences, particularly within the complementarity-determining region 3 (CDR3), requires efficient embedding methods for applying machine learning to immunology. While various TCR CDR3 embedding strategies have been proposed, the absence of their systematic evaluations created perplexity in the community. Here, we extracted CDR3 embedding models from 19 existing methods and benchmarked these models with four curated datasets by accessing their impact on the performance of TCR downstream tasks, including TCR-epitope binding affinity prediction, epitope-specific TCR identification, TCR clustering, and visualization analysis.

View Article and Find Full Text PDF

Atomic force microscopy (AFM) has reached a significant level of maturity in biology, demonstrated by the diversity of modes for obtaining not only topographical images but also insightful mechanical and adhesion data by performing force measurements on delicate samples with a controlled environment (e.g., liquid, temperature, pH).

View Article and Find Full Text PDF

Prediction of dry matter intake in growing Black Bengal goats using artificial neural networks.

Trop Anim Health Prod

January 2025

Livestock Production and Management Section, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243 122, India.

Dry matter intake (DMI) determination is essential for effective management of meat goats, especially in optimizing feed utilization and production efficiency. Unfortunately, farmers often face challenges in accurately predicting DMI which leads to wastage of feed and an increase in the cost of production. This investigation aimed to predict DMI in Black Bengal goats by using body weight (BW), body condition score (BCS), average daily gain (ADG), and metabolic body weight (MBW) by applying an artificial neural network (ANN) model.

View Article and Find Full Text PDF

Strategies to increase the robustness of microbial cell factories.

Adv Biotechnol (Singap)

March 2024

State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China.

Engineering microbial cell factories have achieved much progress in producing fuels, natural products and bulk chemicals. However, in industrial fermentation, microbial cells often face various predictable and stochastic disturbances resulting from intermediate metabolites or end product toxicity, metabolic burden and harsh environment. These perturbances can potentially decrease productivity and titer.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!