Background: The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical.

Results: In this paper we introduce an integrative phenotyping framework (iPF) for disease subtype discovery. A feature topology plot was developed for effective dimension reduction and visualization of multi-omics data. The approach is free of model assumption and robust to data noises or missingness. We developed a workflow to integrate homogeneous patient clustering from different omics data in an agglomerative manner and then visualized heterogeneous clustering of pairwise omics sources. We applied the framework to two batches of lung samples obtained from patients diagnosed with chronic obstructive lung disease (COPD) or interstitial lung disease (ILD) with well-characterized clinical (phenomic) data, mRNA and microRNA expression profiles. Application of iPF to the first training batch identified clusters of patients consisting of homogenous disease phenotypes as well as clusters with intermediate disease characteristics. Analysis of the second batch revealed a similar data structure, confirming the presence of intermediate clusters. Genes in the intermediate clusters were enriched with inflammatory and immune functional annotations, suggesting that they represent mechanistically distinct disease subphenotypes that may response to immunomodulatory therapies. The iPF software package and all source codes are publicly available.

Conclusions: Identification of subclusters with distinct clinical and biomolecular characteristics suggests that integration of phenomic and other omics information could lead to identification of novel mechanism-based disease sub-phenotypes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642618PMC
http://dx.doi.org/10.1186/s12864-015-2170-4DOI Listing

Publication Analysis

Top Keywords

omics data
12
lung disease
12
data
9
integrative phenotyping
8
phenotyping framework
8
framework ipf
8
disease
8
disease subphenotypes
8
intermediate clusters
8
omics
5

Similar Publications

[Gene coexpression networks: concepts and applications].

Biol Aujourdhui

January 2025

Sorbonne Université, CNRS, Inserm U1156, Institut de Biologie Paris Seine, Laboratoire de Biologie du Développement/UMR7622, 9 Quai St-Bernard, 75005 Paris, France.

The advent of high-throughput omics data and the generation of new algorithms provide the biologists with the opportunity to explore living processes in the context of systems biology aiming at revealing the gene interactions, the networks underlying complex cellular functions. In this article, we discuss two methods for gene network reconstruction, WGCNA (Weighted Gene Correlation Network Analysis) developed by Steve Horvath and collaborators in 2008, and MIIC (Multivariate Information-based Inductive Causation) developed by Hervé Isambert and his team in 2017 and 2024. These two methods are complementary, WGCNA generating undirected networks in which most gene-to-gene interactions are indirect, while MIIC reveals direct interactions and some causal links.

View Article and Find Full Text PDF

Single-omics approaches often provide a limited view of complex biological systems, whereas multiomics integration offers a more comprehensive understanding by combining diverse data views. However, integrating heterogeneous data types and interpreting the intricate relationships between biological features-both within and across different data views-remains a bottleneck. To address these challenges, we introduce COSIME (Cooperative Multi-view Integration and Scalable Interpretable Model Explainer).

View Article and Find Full Text PDF

Serum metabolomic signatures of patients with rare neurogenetic diseases: an insight into potential biomarkers and treatment targets.

Front Mol Neurosci

January 2025

Interdisciplinary Centre for Innovations in Biotechnology and Neuroscience, Faculty of Medical Sciences, University of Sri Jayewardenepura, Nugegoda, Sri Lanka.

Introduction: To further advance our understanding of Muscular Dystrophies (MDs) and Spinocerebellar Ataxias (SCAs), it is necessary to identify the biological patterns associated with disease pathology. Although progress has been made in the fields of genetics and transcriptomics, there is a need for proteomics and metabolomics studies. The present study aimed to be the first to document serum metabolic signatures of MDs (DMD, BMD, and LGMD 2A) SCAs (SCA 1-3), from a South Asian perspective.

View Article and Find Full Text PDF

The G2PDeep-v2 server is a web-based platform powered by deep learning, for phenotype prediction and markers discovery from multi-omics data in any organisms including humans, plants, animals, and viruses. The server provides multiple services for researchers to create deep-learning models through an interactive interface and train these models using an automated hyperparameter tuning algorithm on high-performance computing resources. Users can visualize the results of phenotype and markers predictions and perform Gene Set Enrichment Analysis for the significant markers to provide insights into the molecular mechanisms underlying complex diseases, conditions and other biological phenotypes being studied.

View Article and Find Full Text PDF

Integrative analysis of miRNA expression data reveals a minimal signature for tumour cells classification.

Comput Struct Biotechnol J

December 2024

Interdisciplinary Research Centre on Biomaterials (CRIB), Università degli Studi di Napoli "Federico II", Piazzale Tecchio 80, Naples 80125, Italy.

MicroRNAs (miRNAs) are pivotal biomarkers for cancer screening. Identifying distinctive expression patterns of miRNAs in specific cancer types can serve as an effective strategy for classification and characterization. However, the development of a minimal signature of miRNAs for accurate cancer classification remains challenging, hindered by the lack of integrated approaches that systematically analyse miRNA expression levels of miRNAs alongside their associated biological pathways.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!