Structure-adaptive canonical correlation analysis for microbiome multi-omics data.

Front Genet

Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, United States.

Published: November 2024

AI Article Synopsis

  • Sparse canonical correlation analysis (sCCA) effectively integrates high-dimensional datasets by identifying correlated features, making it relevant for studying host-microbiome interactions at various molecular levels.
  • Previous sCCA methods struggle with the compositional nature of microbiome data, limiting their effectiveness in this context.
  • The proposed new sCCA framework addresses these limitations by incorporating the compositional characteristics of microbiome data and allowing for structure-based constraints, showing improvements through simulations and real-world analysis.

Article Abstract

Sparse canonical correlation analysis (sCCA) has been a useful approach for integrating different high-dimensional datasets by finding a subset of correlated features that explain the most correlation in the data. In the context of microbiome studies, investigators are always interested in knowing how the microbiome interacts with the host at different molecular levels such as genome, methylol, transcriptome, metabolome and proteome. sCCA provides a simple approach for exploiting the correlation structure among multiple omics data and finding a set of correlated omics features, which could contribute to understanding the host-microbiome interaction. However, existing sCCA methods do not address compositionality, and its application to microbiome data is thus not optimal. This paper proposes a new sCCA framework for integrating microbiome data with other high-dimensional omics data, accounting for the compositional nature of microbiome sequencing data. It also allows integrating prior structure information such as the grouping structure among bacterial taxa by imposing a "soft" constraint on the coefficients through varying penalization strength. As a result, the method provides significant improvement when the structure is informative while maintaining robustness against a misspecified structure. Through extensive simulation studies and real data analysis, we demonstrate the superiority of the proposed framework over the state-of-the-art approaches.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11626081PMC
http://dx.doi.org/10.3389/fgene.2024.1489694DOI Listing

Publication Analysis

Top Keywords

canonical correlation
8
correlation analysis
8
data
8
omics data
8
microbiome data
8
microbiome
6
structure
5
structure-adaptive canonical
4
correlation
4
analysis microbiome
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!