Deep multi-omics integration by learning correlation-maximizing representation identifies prognostically stratified cancer subtypes.

Bioinform Adv

Department of Biomedical Informatics, Stony Brook Cancer Center, Stony Brook Medicine, Stony Brook University, Stony Brook, NY 11794, USA.

Published: June 2023

Motivation: Molecular subtyping by integrative modeling of multi-omics and clinical data can help the identification of robust and clinically actionable disease subgroups; an essential step in developing precision medicine approaches.

Results: We developed a novel outcome-guided molecular subgrouping framework, called Deep Multi-Omics Integrative Subtyping by Maximizing Correlation (DeepMOIS-MC), for integrative learning from multi-omics data by maximizing correlation between all input -omics views. DeepMOIS-MC consists of two parts: clustering and classification. In the clustering part, the preprocessed high-dimensional multi-omics views are input into two-layer fully connected neural networks. The outputs of individual networks are subjected to Generalized Canonical Correlation Analysis loss to learn the shared representation. Next, the learned representation is filtered by a regression model to select features that are related to a covariate clinical variable, for example, a survival/outcome. The filtered features are used for clustering to determine the optimal cluster assignments. In the classification stage, the original feature matrix of one of the -omics view is scaled and discretized based on equal frequency binning, and then subjected to feature selection using RandomForest. Using these selected features, classification models (for example, XGBoost model) are built to predict the molecular subgroups that were identified at clustering stage. We applied DeepMOIS-MC on lung and liver cancers, using TCGA datasets. In comparative analysis, we found that DeepMOIS-MC outperformed traditional approaches in patient stratification. Finally, we validated the robustness and generalizability of the classification models on independent datasets. We anticipate that the DeepMOIS-MC can be adopted to many multi-omics integrative analyses tasks.

Availability And Implementation: Source codes for PyTorch implementation of DGCCA and other DeepMOIS-MC modules are available at GitHub (https://github.com/duttaprat/DeepMOIS-MC).

Supplementary Information: Supplementary data are available at online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328436PMC
http://dx.doi.org/10.1093/bioadv/vbad075DOI Listing

Publication Analysis

Top Keywords

deep multi-omics
8
multi-omics integrative
8
maximizing correlation
8
classification models
8
deepmois-mc
6
multi-omics
5
multi-omics integration
4
integration learning
4
learning correlation-maximizing
4
correlation-maximizing representation
4

Similar Publications

scMDCL: A Deep Collaborative Contrastive Learning Framework for Matched Single-Cell Multiomics Data Clustering.

J Chem Inf Model

March 2025

Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China.

Single-cell multiomics clustering integrates multiple omics data to analyze cellular heterogeneity and is crucial for uncovering complex biological processes and disease mechanisms. However, existing matched single-cell multiomics clustering methods often neglect the full utilization of intercellular relationships and the interactions and synergy between features from different omics, leading to suboptimal clustering performance. In this paper, we propose a deep collaborative contrastive learning framework for matched single-cell multiomics data clustering, named scMDCL.

View Article and Find Full Text PDF

ALS molecular subtypes are a combination of cellular and pathological features learned by deep multiomics classifiers.

Cell Rep

March 2025

Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA; Department of Neuroscience & Neuroscience Institute, NYU Langone Health, New York, NY 10016, USA. Electronic address:

Amyotrophic lateral sclerosis (ALS) is a complex syndrome with multiple genetic causes and wide variation in disease presentation. Despite this heterogeneity, large-scale genomics studies revealed that ALS postmortem samples can be grouped into a small number of subtypes, defined by transcriptomic signatures of mitochondrial dysfunction and oxidative stress (ALS-Ox), microglial activation and neuroinflammation (ALS-Glia), or TDP-43 pathology and associated transposable elements (ALS-TE). In this study, we present a deep ALS neural net classifier (DANCer) for ALS molecular subtypes.

View Article and Find Full Text PDF

Parkinson's disease (PD) is a complex, progressive neurodegenerative disorder with high heterogeneity, making early diagnosis difficult. Early detection and intervention are crucial for slowing PD progression. Understanding PD's diverse pathways and mechanisms is key to advancing knowledge.

View Article and Find Full Text PDF

Cohort Profile: TRacing Etiology of Non-communicable Diseases (TREND): Rationale, Progress and Perspective.

Phenomics

December 2024

Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, 210009 China.

Unlabelled: The TRacing Etiology of Non-communicable Diseases (TREND) cohort is a prospective longitudinal cohort and biobank that is mainly based in Ma'anshan, Anhui Province, China. The primary aim of the study is to decipher comprehensive molecular characterization and deep phenotyping for a broad spectrum of chronic non-communicable diseases (NCDs), which focuses on providing mechanistic insights with diagnostic, prognostic and therapeutic implications. The recruitment was initiated in 2023 and is expected to complete in 2025 with 20,000 participants originated from urban and rural area.

View Article and Find Full Text PDF

Telomere-to-telomere genome and multi-omics analysis of Prunus avium cv. Tieton provides insights into its genomic evolution and flavonoid biosynthesis.

Int J Biol Macromol

March 2025

Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology, College of Life Sciences, Northwest University, Xi'an 710069, Shaanxi, China. Electronic address:

The European sweet cherry (Prunus avium) is highly valued for its superior quality, delectable taste, and robust stress resistance, leading to its extensive cultivation in the world. However, the previous incomplete genome assemblies have impeded its evolution and genetic regulation studies. In this study, we generated a Telomere-to-Telomere gap-free genome assembly of P.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!