Proteomes are well known to poorly correlate with transcriptomes measured from the same sample. While connected, the complex processes that impact the relationships between transcript and protein quantities remains an open research topic. Many studies have attempted to predict proteomes from transcriptomes with limited success. Here we use publicly available data from the Clinical Proteomics Tumor Analysis Consortium to show that deep learning models designed by neural architecture search (NAS) achieve improved prediction accuracy of proteome quantities from transcriptomics. We find that this benefit is largely due to including a residual connection in the architecture that allows input information to be remembered near the end of the network. Finally, we explore which groups of transcripts are functionally important for protein prediction using model interpretation with SHAP.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11257616PMC
http://dx.doi.org/10.1101/2024.07.08.602560DOI Listing

Publication Analysis

Top Keywords

deep learning
8
multi-dataset integration
4
integration residual
4
residual connections
4
connections improve
4
improve proteome
4
proteome prediction
4
prediction transcriptomes
4
transcriptomes deep
4
learning proteomes
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!