Cancer is a heterogeneous disease, and patients with tumors from different organs can share similar epigenetic and genetic alterations. Therefore, it is crucial to identify the novel subgroups of patients with similar molecular characteristics. It is possible to propose a better treatment strategy when the heterogeneity of the patient is accounted for during subgroup identification, irrespective of the tissue of origin. This work proposes a machine learning (ML) based pipeline for subgroup identification in pan-cancer. Here, mRNA, miRNA, DNA methylation, and protein expression features from pan-cancer samples were concatenated and non-linearly projected to a lower dimension using an ML algorithm. This data was then clustered to identify multi-omics-based novel subgroups. The clinical characterization of these ML subgroups indicated significant differences in overall survival (OS) and disease-free survival (DFS) (p-value<0.0001). The subgroups formed by the patients from different tumors shared similar molecular alterations in terms of immune microenvironment, mutation profile, and enriched pathways. Further, decision-level and feature-level fused classification models were built to identify the novel subgroups for unseen samples. Additionally, the classification models were used to obtain the class labels for the validation samples, and the molecular characteristics were verified. To summarize, this work identified novel ML subgroups using multi-omics data and showed that the patients with different tumor types could be similar molecularly. We also proposed and validated the classification models for subgroup identification. The proposed classification models can be used to identify the novel multi-omics subgroups, and the molecular characteristics of each subgroup can be used to design appropriate treatment regimen.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586677PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0287176PLOS

Publication Analysis

Top Keywords

subgroup identification
12
machine learning
8
novel subgroups
8
integration pan-cancer
4
pan-cancer multi-omics
4
multi-omics data
4
data novel
4
novel mixed
4
mixed subgroup
4
identification machine
4

Similar Publications

Pharmacogenomics stands as a pivotal driver toward personalized medicine, aiming to optimize drug efficacy while minimizing adverse effects by uncovering the impact of genetic variations on inter-individual outcome variability. Despite its promise, the intricate landscape of drug metabolism introduces complexity, where the correlation between drug response and genes can be shaped by numerous nongenetic factors, often exhibiting heterogeneity across diverse subpopulations. This challenge is particularly pronounced in datasets such as the International Warfarin Pharmacogenetic Consortium (IWPC), which encompasses diverse patient information from multiple nations.

View Article and Find Full Text PDF

Background: Association between dietary factors and the risk of developing inflammatory bowel disease (IBD) has been studied extensively. However, identification of deleterious dietary patterns merits further study.

Aim: To investigate the risk of developing Crohn's disease (CD) and ulcerative colitis (UC) according to the inflammatory score of the diet (ISD) in the multinational European Prospective Investigation into Cancer and Nutrition (EPIC) cohort.

View Article and Find Full Text PDF

Purpose: Research suggests that insulin resistance (IR) is associated with acute ischemic stroke (AIS) and depression. The use of insulin-based IR assessments is complicated. Therefore, we explored the relationship between four non-insulin-based IR indices and post-stroke depression (PSD).

View Article and Find Full Text PDF

Background: The triglyceride glucose-body mass index (TyG-BMI) is considered to be a reliable surrogate marker of insulin resistance (IR). However, limited evidence exists regarding its association with the severity of coronary artery disease (CAD), particularly in hypertensive patients with different glucose metabolic states, including those with H-type hypertension. This study aimed to investigate the relationship between TyG-BMI and CAD severity across different glucose metabolism conditions.

View Article and Find Full Text PDF

Hepatocellular carcinoma (HCC) is a predominant cause of cancer-related mortality globally, noted for its propensity towards late-stage diagnosis and scarcity of effective treatment modalities. The process of metabolic reprogramming, with a specific emphasis on lipid metabolism, is instrumental in the progression of HCC. Nevertheless, the precise mechanisms through which lipid metabolism impacts HCC and its viability as a therapeutic target have yet to be fully elucidated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!