Denoising DNA Encoded Library Screens with Sparse Learning.

ACS Comb Sci

Totient, Inc., Sinđelićeva 9, 11000 Belgrade, Serbia.

Published: August 2020

DNA-encoded libraries (DELs) are large, pooled collections of compounds in which every library member is attached to a stretch of DNA encoding its complete synthetic history. DEL-based hit discovery involves affinity selection of the library against a protein of interest, whereby compounds retained by the target are subsequently identified by next-generation sequencing of the corresponding DNA tags. When analyzing the resulting data, one typically assumes that sequencing output (i.e., read counts) is proportional to the binding affinity of a given compound, thus enabling hit prioritization and elucidation of any underlying structure-activity relationships (SAR). This assumption, though, tends to be severely confounded by a number of factors, including variable reaction yields, presence of incomplete products masquerading as their intended counterparts, and sequencing noise. In practice, these confounders are often ignored, potentially contributing to low hit validation rates, and universally leading to loss of valuable information. To address this issue, we have developed a method for comprehensively denoising DEL selection outputs. Our method, dubbed "deldenoiser", is based on sparse learning and leverages inputs that are commonly available within a DEL generation and screening workflow. Using simulated and publicly available DEL affinity selection data, we show that "deldenoiser" is not only able to recover and rank true binders much more robustly than read count-based approaches but also that it yields scores, which accurately capture the underlying SAR. The proposed method can, thus, be of significant utility in hit prioritization following DEL screens.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acscombsci.0c00007DOI Listing

Publication Analysis

Top Keywords

sparse learning
8
affinity selection
8
hit prioritization
8
denoising dna
4
dna encoded
4
encoded library
4
library screens
4
screens sparse
4
learning dna-encoded
4
dna-encoded libraries
4

Similar Publications

Duchenne muscular dystrophy is a severe neuromuscular disorder, caused by mutations in the DMD gene. Normally, the DMD gene gives rise to multiple dystrophin isoforms, of which multiple are expressed in the brain. The location of the mutation determines the number of dystrophin isoforms affected, and the absence thereof leads to behavioral and cognitive impairments.

View Article and Find Full Text PDF

Personalized cancer drug treatment is emerging as a frontier issue in modern medical research. Considering the genomic differences among cancer patients, determining the most effective drug treatment plan is a complex and crucial task. In response to these challenges, this study introduces the Adaptive Sparse Graph Contrastive Learning Network (ASGCL), an innovative approach to unraveling latent interactions in the complex context of cancer cell lines and drugs.

View Article and Find Full Text PDF

Background: Late-life depression (LLD) is often accompanied by cognitive impairment, which may persist despite antidepressant treatment. Repetitive transcranial magnetic stimulation (rTMS) is an efficacious treatment for depression, with potential benefits on cognitive functioning. However, research on cognitive effects is inconclusive, relatively sparse in LLD, and predominantly focused on group-level cognitive changes.

View Article and Find Full Text PDF

scSMD: a deep learning method for accurate clustering of single cells based on auto-encoder.

BMC Bioinformatics

January 2025

Department of Surgery, Shanghai Key Laboratory of Gastric Neoplasms, Shanghai Institute of Digestive Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.

Background: Single-cell RNA sequencing (scRNA-seq) has transformed biological research by offering new insights into cellular heterogeneity, developmental processes, and disease mechanisms. As scRNA-seq technology advances, its role in modern biology has become increasingly vital. This study explores the application of deep learning to single-cell data clustering, with a particular focus on managing sparse, high-dimensional data.

View Article and Find Full Text PDF

Simplified self-supervised learning for hybrid propagation graph-based recommendation.

Neural Netw

January 2025

School of Big Data & Software Engineering, Chongqing University, Chongqing, 401331, China. Electronic address:

Recent progress in Graph Convolutional Networks (GCNs) has facilitated their extensive application in recommendation, yielding notable performance gains. Nevertheless, existing GCN-based recommendation approaches are confronted with several challenges: (1) how to effectively leverage multi-order graph connectivity to derive meaningful node embeddings; (2) faced with sparse raw data, how to augment supervision signals without relying on auxiliary information; (3) given that GCNs necessitate the aggregation of neighborhood nodes, and the sparsity of these nodes can exacerbate the impact of noise data, how to mitigate the noise problem inherent in the raw data. For tackling aforementioned challenges, we devise a new hybrid propagation GCN-based method named S3HGN, incorporating a simplified self-supervised learning paradigm for recommendation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!