AI Article Synopsis

  • Accurate cancer subtype identification is essential for prognosis and personalized treatment, but traditional clustering methods struggle due to data complexity.
  • A novel contrastive-learning-based approach using deep learning was proposed to better cluster patients based on multi-omics data, yielding superior results in separating survival outcomes across nine datasets.
  • The study identified important cancer-related genes and showed that mRNA is the most influential omics data type for cancer survival, highlighting the method's effectiveness in revealing biologically significant cancer subtypes.

Article Abstract

Background: Accurate identification of cancer subtypes is crucial for disease prognosis evaluation and personalized patient management. Recent advances in computational methods have demonstrated that multi-omics data provides valuable insights into tumor molecular subtyping. However, the high dimensionality and small sample size of the data may result in ambiguous and overlapping cancer subtypes during clustering. In this study, we propose a novel contrastive-learning-based approach to address this issue. The proposed end-to-end deep learning method can extract crucial information from the multi-omics features by self-supervised learning for patient clustering.

Results: By applying our method to nine public cancer datasets, we have demonstrated superior performance compared to existing methods in separating patients with different survival outcomes (p < 0.05). To further evaluate the impact of various omics data on cancer survival, we developed an XGBoost classification model and found that mRNA had the highest importance score, followed by DNA methylation and miRNA. In the presented case study, our method successfully clustered subtypes and identified 14 cancer-related genes, of which 12 (85.7%) were validated through literature review.

Conclusions: Our findings demonstrate that our method is capable of identifying cancer subtypes that are both statistically and biologically significant. The code about COLCS is given at: https://github.com/Mercuriiio/COLCS .

Download full-text PDF

Source
http://dx.doi.org/10.1007/s12539-024-00641-yDOI Listing

Publication Analysis

Top Keywords

multi-omics data
8
cancer subtypes
8
contrastive-learning-based deep
4
deep neural
4
neural network
4
cancer
4
network cancer
4
cancer subtyping
4
subtyping integrating
4
integrating multi-omics
4

Similar Publications

Background: PSEN1, PSEN2, and APP mutations cause Alzheimer's disease (AD) with an early age at onset (AAO) and progressive cognitive decline. PSEN1 mutations are more common and generally have an earlier AAO; however, certain PSEN1 mutations cause a later AAO, similar to those observed in PSEN2 and APP.

Methods: We examined whether common disease endotypes exist across these mutations with a later AAO (~ 55 years) using hiPSC-derived neurons from familial Alzheimer's disease (FAD) patients harboring mutations in PSEN1, PSEN2, and APP and mechanistically characterized by integrating RNA-seq and ATAC-seq.

View Article and Find Full Text PDF

Understanding the molecular landscape of nonmuscle-invasive bladder cancer (NMIBC) is essential to improve risk assessment and treatment regimens. We performed a comprehensive genomic analysis of patients with NMIBC using whole-exome sequencing (n = 438), shallow whole-genome sequencing (n = 362) and total RNA sequencing (n = 414). A large genomic variation within NMIBC was observed and correlated with different molecular subtypes.

View Article and Find Full Text PDF

Background: This study aimed to investigate the prognostic impact of lymph node metastasis (LNM) on patients with colorectal cancer liver metastasis (CRLM) and elucidate the underlying immune mechanisms using multiomics profiling.

Methods: We enrolled patients with CRLM from the US Surveillance, Epidemiology, and End Results (SEER) cohort and a multicenter Chinese cohort, integrating bulk RNA sequencing, single-cell RNA sequencing and proteomics data. The cancer-specific survival (CSS) and immune profiles of the tumor-draining lymph nodes (TDLNs), primary tumors and liver metastasis were compared between patients with and without LNM.

View Article and Find Full Text PDF

Spatially resolved omics (SRO) technologies enable the identification of cell types while preserving their organization within tissues. Application of such technologies offers the opportunity to delineate cell-type spatial relationships, particularly across different length scales, and enhance our understanding of tissue organization and function. To quantify such multi-scale cell-type spatial relationships, we present CRAWDAD, Cell-type Relationship Analysis Workflow Done Across Distances, as an open-source R package.

View Article and Find Full Text PDF

The extensive application of graphene nanosheets (GNSs) has raised concerns over risks to sensitive species in the aquatic environment. The humic acid (HA) corona is traditionally considered to reduce GNSs toxicity. Here, we evaluate the effect of sorbed HA (GNSs-HA) on the toxicity of GNSs to Gram positive Bacillus tropicus.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!