Multi-dataset Integration and Residual Connections Improve Proteome Prediction from Transcriptomes using Deep Learning.

bioRxiv

Department of Computational Biomedicine, Cedars Sinai Medical Center, Los Angeles CA 90048.

Published: July 2024

Proteomes are well known to poorly correlate with transcriptomes measured from the same sample. While connected, the complex processes that impact the relationships between transcript and protein quantities remains an open research topic. Many studies have attempted to predict proteomes from transcriptomes with limited success. Here we use publicly available data from the Clinical Proteomics Tumor Analysis Consortium to show that deep learning models designed by neural architecture search (NAS) achieve improved prediction accuracy of proteome quantities from transcriptomics. We find that this benefit is largely due to including a residual connection in the architecture that allows input information to be remembered near the end of the network. Finally, we explore which groups of transcripts are functionally important for protein prediction using model interpretation with SHAP.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11257616	PMC
http://dx.doi.org/10.1101/2024.07.08.602560	DOI Listing

Publication Analysis

Top Keywords

deep learning

multi-dataset integration

integration residual

residual connections

connections improve

improve proteome

proteome prediction

prediction transcriptomes

transcriptomes deep

learning proteomes

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!