MB-SupCon: Microbiome-based Predictive Models via Supervised Contrastive Learning.

J Mol Biol

Department of Statistical Science, Southern Methodist University, Dallas, TX 75275, United States; Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX 75390, United States; Center for Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390, United States. Electronic address:

Published: August 2022

Human microbiome consists of trillions of microorganisms. Microbiota can modulate the host physiology through molecule and metabolite interactions. Integrating microbiome and metabolomics data have the potential to predict different diseases more accurately. Yet, most datasets only measure microbiome data but without paired metabolome data. Here, we propose a novel integrative modeling framework, Microbiome-based Supervised Contrastive Learning Framework (MB-SupCon). MB-SupCon integrates microbiome and metabolome data to generate microbiome embeddings, which can be used to improve the prediction accuracy in datasets that only measure microbiome data. As a proof of concept, we applied MB-SupCon on 720 samples with paired 16S microbiome data and metabolomics data from patients with type 2 diabetes. MB-SupCon outperformed existing prediction methods and achieved high average prediction accuracies for insulin resistance status (84.62%), sex (78.98%), and race (80.04%). Moreover, the microbiome embeddings form separable clusters for different covariate groups in the lower-dimensional space, which enhances data visualization. We also applied MB-SupCon on a large inflammatory bowel disease study and observed similar advantages. Thus, MB-SupCon could be broadly applicable to improve microbiome prediction models in multi-omics disease studies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10284149PMC
http://dx.doi.org/10.1016/j.jmb.2022.167693DOI Listing

Publication Analysis

Top Keywords

microbiome data
12
microbiome
9
supervised contrastive
8
contrastive learning
8
data
8
metabolomics data
8
datasets measure
8
measure microbiome
8
metabolome data
8
microbiome embeddings
8

Similar Publications

Modern treatment, a healthy diet, and physical activity routines lower the risk factors for metabolic syndrome; however, this condition is associated with all-cause and cardiovascular mortality worldwide. This investigation involved a randomized controlled trial, double-blind, parallel study. Fifty-eight participants with risk factors of metabolic syndrome according to the inclusion criteria were randomized into two groups and given probiotics (Lacticaseibacillus paracasei MSMC39-1 and Bifidobacterium animalis TA-1) (n = 31) or a placebo (n = 27).

View Article and Find Full Text PDF

Exploration missions to Mars rely on landers or rovers to perform multiple analyses over geographically small sampling regions, while landing site selection is done using large-scale but low-resolution remote-sensing data. Utilizing Earth analog environments to estimate small-scale spatial and temporal variation in key geochemical signatures and biosignatures will help mission designers ensure future sampling strategies meet mission science goals. Icelandic lava fields can serve as Mars analog sites due to conditions that include low nutrient availability, temperature extremes, desiccation, and isolation from anthropogenic contamination.

View Article and Find Full Text PDF

Safer chemical alternatives to bisphenol (BP) have been a major pursuit of modern green chemistry and toxicology. Using a chemical similarity-based approach, it is difficult to identify minor structural differences that contribute to the significant changes of toxicity. Here, we used omics and computational toxicology to identify chemical features associated with BP analogue-induced embryonic toxicity, offering valuable insights to inform the design of safer chemical alternatives.

View Article and Find Full Text PDF

Epidemiological studies combining taxonomic and clinical data have been limited globally, particularly Guiyang, the most under-developed economic provincial capital city in southwestern China. A retrospective analysis was performed of dermatophyte epidemiology involving all culture-positive cases received between May 2017 and May 2023 at the Affiliated Hospital of Guizhou Medical University. Phylogenetic analysis was conducted on 391 dermatophyte isolates collected from patients using the rDNA internal transcribed spacer sequences.

View Article and Find Full Text PDF

The microbiomes of host organisms and their direct source environments are closely linked and key for shaping microbial community dynamics. The relationship between these linked dynamics is largely unexplored because source substrates are usually unavailable. To address this current knowledge gap, we employed bacteriovorous nematodes as a unique model system, for which source substrates like rotting apples can be easily collected.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!