PartIES: a disease subtyping framework with Partition-level Integration using diffusion-Enhanced Similarities from multi-omics Data.

Brief Bioinform

Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10027, United States.

Published: November 2024

Integrating multi-omics data helps identify disease subtypes. Many similarity-based methods were developed for disease subtyping using multi-omics data, with many of them focusing on extracting common clustering structures across multiple types of omics data, but not preserving data-type-specific clustering structures. Moreover, clustering performance of similarity-based methods is affected when similarity measures are noisy. Here we proposed PartIES, a Partition-level Integration using diffusion-Enhanced Similarities to perform disease subtyping using multi-omics data. PartIES uses diffusion to reduce noises in individual similarity/kernel matrices from individual omics data types first, and then extract partition information from diffusion-enhanced similarity matrices and integrate the partition-level similarity through a weighted average iteratively. Simulation studies showed that (1) the diffusion step enhances clustering accuracy, and (2) PartIES outperforms competing methods, particularly when omics data types provide different clustering structures. Using mRNA, long noncoding RNAs, microRNAs expression data, DNA methylation data, and somatic mutation data from The Cancer Genome Atlas project, PartIES identified subtypes in bladder urothelial carcinoma, liver hepatocellular carcinoma, and thyroid carcinoma that are most significantly associated with patient survival across all methods. Further investigations suggested that among subtype-associated genes, many of those that are highly interacting with other genes are known important cancer genes. The identified cancer subtypes also have different activity levels for some known cancer-related pathways. The R code can be accessed at https://github.com/yuqimiao/PartIES.git.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11586768PMC
http://dx.doi.org/10.1093/bib/bbae609DOI Listing

Publication Analysis

Top Keywords

multi-omics data
16
disease subtyping
12
clustering structures
12
omics data
12
data
10
partition-level integration
8
integration diffusion-enhanced
8
diffusion-enhanced similarities
8
similarity-based methods
8
subtyping multi-omics
8

Similar Publications

Precision medicine in less-defined subtype diffuse large B-cell lymphoma (DLBCL) remains a challenge due to the heterogeneous nature of the disease. Programmed cell death (PCD) pathways are crucial in the advancement of lymphoma and serve as significant prognostic markers for individuals afflicted with lymphoid cancers. To identify robust prognostic biomarkers that can guide personalized management for less-defined subtype DLBCL patients, we integrated multi-omics data derived from 339 standard R-CHOP-treated patients diagnosed with less-defined subtype DLBCL from three independent cohorts.

View Article and Find Full Text PDF

The lysine-specific demethylase 5 (KDM5) family, a key post-translational modification of chromatin, can shape tumor immune microenvironment. Here, we performed an extensive clinical and bioinformatic analysis to explore the association between KDM5 mutation and tumor immunity and its impact on the outcomes in pan-cancer immunotherapy. In 2943 patients across 12 tumor types treated with immune checkpoint inhibitors, KDM5-mutant tumors were associated with favorable overall survival (hazard ratio, 0.

View Article and Find Full Text PDF

Genome assembly and multi-omics analyses of Isodon lophanthodies provide insights into the distribution of medicinal metabolites induced by exogenous methyl jasmonate.

BMC Plant Biol

December 2024

Guangdong Provincial Key Laboratory of Crops Genetics and Improvement, Crop Research Institute, Guangdong Academy of Agriculture Sciences, Guangzhou, 510640, China.

Background: Isodon lophanthodies is a perennial herb and the whole plant has medicinal value distributed in southern China and southeast Asia. The absence of a reference genome has hindered evolution and genomic breeding research of this species.

Results: In this study, we present a high-quality, chromosome-level genome assembly of I.

View Article and Find Full Text PDF

Multiomics integration and machine learning reveal prognostic programmed cell death signatures in gastric cancer.

Sci Rep

December 2024

Clinical Teaching Hospital of Medical School, Nanjing Children's Hospital, Nanjing University, Nanjing, 210008, China.

Gastric cancer (GC) is characterized by notable heterogeneity and the impact of molecular subtypes on treatment and prognosis. The role of programmed cell death (PCD) in cellular processes is critical, yet its specific function in GC is underexplored. This study applied multiomics approaches, integrating transcriptomic, epigenetic, and somatic mutation data, with consensus clustering algorithms to classify GC molecular subtypes and assess their biological and immunological features.

View Article and Find Full Text PDF

Cardiovascular diseases (CVDs) and cerebrovascular diseases (CeVDs) are closely related vascular diseases, sharing common cardiometabolic risk factors (RFs). Although pleiotropic genetic variants of these two diseases have been reported, their underlying pathological mechanisms are still unclear. Leveraging GWAS summary data and using genetic correlation, pleiotropic variants identification, and colocalization analyses, we identified 11 colocalized loci for CVDs-CeVDs-BP (blood pressure), CVDs-CeVDs-LIP (lipid traits), and CVDs-CeVDs-cIMT (carotid intima-media thickness) triplets.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!