Inflammatory bowel disease (IBD) is characterized by complex etiology and a disrupted colonic ecosystem. We provide a framework for the analysis of multi-omic data, which we apply to study the gut ecosystem in IBD. Specifically, we train and validate models using data on the metagenome, metatranscriptome, virome, and metabolome from the Human Microbiome Project 2 IBD multi-omic database, with 1,785 repeated samples from 130 individuals (103 cases and 27 controls). After splitting the participants into training and testing groups, we used mixed-effects least absolute shrinkage and selection operator regression to select features for each omic. These features, with demographic covariates, were used to generate separate single-omic prediction scores. All four single-omic scores were then combined into a final regression to assess the relative importance of the individual omics and the predictive benefits when considered together. We identified several species, pathways, and metabolites known to be associated with IBD risk, and we explored the connections between data sets. Individually, metabolomic and viromic scores were more predictive than metagenomics or metatranscriptomics, and when all four scores were combined, we predicted disease diagnosis with a Nagelkerke's of 0.46 and an area under the curve of 0.80 (95% confidence interval: 0.63, 0.98). Our work supports that some single-omic models for complex traits are more predictive than others, that incorporating multiple omic data sets may improve prediction, and that each omic data type provides a combination of unique and redundant information. This modeling framework can be extended to other complex traits and multi-omic data sets.IMPORTANCEComplex traits are characterized by many biological and environmental factors, such that multi-omic data sets are well-positioned to help us understand their underlying etiologies. We applied a prediction framework across multiple omics (metagenomics, metatranscriptomics, metabolomics, and viromics) from the gut ecosystem to predict inflammatory bowel disease (IBD) diagnosis. The predicted scores from our models highlighted key features and allowed us to compare the relative utility of each omic data set in single-omic versus multi-omic models. Our results emphasized the importance of metabolomics and viromics over metagenomics and metatranscriptomics for predicting IBD status. The greater predictive capability of metabolomics and viromics is likely because these omics serve as markers of lifestyle factors such as diet. This study provides a modeling framework for multi-omic data, and our results show the utility of combining multiple omic data types to disentangle complex disease etiologies and biological signatures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10805030PMC
http://dx.doi.org/10.1128/msystems.00677-23DOI Listing

Publication Analysis

Top Keywords

multi-omic data
16
omic data
16
inflammatory bowel
12
bowel disease
12
data sets
12
metagenomics metatranscriptomics
12
metabolomics viromics
12
data
10
predict inflammatory
8
disease diagnosis
8

Similar Publications

Fibrolamellar Hepatocellular Carcinoma (FLC) is a rare liver cancer characterized by a fusion oncokinase of the genes DNAJB1 and PRKACA, the catalytic subunit of protein kinase A (PKA). A few FLC-like tumors have been reported showing other alterations involving PKA. To better understand FLC pathogenesis and the relationships among FLC, FLC-like, and other liver tumors, we performed a massive multi-omics analysis.

View Article and Find Full Text PDF

Unlocking the future of complex human diseases prediction: multi-omics risk score breakthrough.

Front Bioinform

December 2024

Department of Immunology and Molecular Biology, College of Health Sciences, School of Biomedical Sciences, Makerere University, Kampala, Uganda.

View Article and Find Full Text PDF

Development and validation of a nomogram for predicting venous thromboembolism risk in post-surgery patients with cervical cancer.

World J Surg Oncol

December 2024

Chongqing Cancer Multiomics Big Data Application Engineering Research Center, Chongqing University Cancer Hospital, Chongqing, 400030, China.

Objective: Postoperative venous thromboembolism (VTE) is a potentially life-threatening complication. This study aimed to develop a predictive model to identify independent risk factors and estimate the likelihood of VTE in patients undergoing surgery for cervical cancer.

Methods: We conducted a retrospective cohort study involving 1,174 patients who underwent surgery for cervical carcinoma between 2019 and 2022.

View Article and Find Full Text PDF

Neurodegenerative diseases present complex genetic architectures, reflecting a continuum from monogenic to oligogenic and polygenic models. Recent advances in multi-omics data, coupled with systems genetics, have significantly refined our understanding of how these data impact neurodegenerative disease mechanisms. To contextualize these genetic discoveries, we provide a comprehensive critical overview of genetic architecture concepts, from Mendelian inheritance to the latest insights from oligogenic and omnigenic models.

View Article and Find Full Text PDF

Background: The mechanisms underlying the complex relationship between autoimmune hypothyroidism and neurological disorders remain unclear. We conducted a comprehensive analysis of associations between alternative splicing, transcriptomics, and proteomics data and autoimmune hypothyroidism.

Methods: Splicing-Wide association studies (SWAS), proteome-wide association studies (PWAS), and transcriptome-wide association studies (TWAS) were used to identify genes and proteins that regulate autoimmune hypothyroidism within the brain axis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!