Finding phylogeny-aware and biologically meaningful averages of metagenomic samples: L2UniFrac.

Bioinformatics

Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, United States.

Published: June 2023

Motivation: Metagenomic samples have high spatiotemporal variability. Hence, it is useful to summarize and characterize the microbial makeup of a given environment in a way that is biologically reasonable and interpretable. The UniFrac metric has been a robust and widely used metric for measuring the variability between metagenomic samples. We propose that the characterization of metagenomic environments can be improved by finding the average, a.k.a. the barycenter, among the samples with respect to the UniFrac distance. However, it is possible that such a UniFrac-average includes negative entries, making it no longer a valid representation of a metagenomic community.

Results: To overcome this intrinsic issue, we propose a special version of the UniFrac metric, termed L2UniFrac, which inherits the phylogenetic nature of the traditional UniFrac and with respect to which one can easily compute the average, producing biologically meaningful environment-specific "representative samples." We demonstrate the usefulness of such representative samples as well as the extended usage of L2UniFrac in efficient clustering of metagenomic samples, and provide mathematical characterizations and proofs to the desired properties of L2UniFrac.

Availability And Implementation: A prototype implementation is provided at https://github.com/KoslickiLab/L2-UniFrac.git. All figures, data, and analysis can be reproduced at https://github.com/KoslickiLab/L2-UniFrac-Paper.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311324PMC
http://dx.doi.org/10.1093/bioinformatics/btad238DOI Listing

Publication Analysis

Top Keywords

metagenomic samples
16
biologically meaningful
8
unifrac metric
8
metagenomic
6
samples
6
finding phylogeny-aware
4
phylogeny-aware biologically
4
meaningful averages
4
averages metagenomic
4
samples l2unifrac
4

Similar Publications

Comprehensive analysis of the interaction microbiome and prostate cancer: an initial exploration from multi-cohort metagenome and GWAS studies.

J Transl Med

January 2025

Department and Institute of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No.1095 Jiefang Avenue, Wuhan, Wuhan, 430030, P.R. China.

Introduction: Prostate cancer is one of the most common cancers in the United States with a high mortality rate. In recent years, the traditional opinion about prostate microbiome was challenged. Although there still are some arguments, an escalating number of researchers are shifting their focus toward the microbiome within the prostate tumor environment.

View Article and Find Full Text PDF

Background: Maintaining gut health is a persistent and unresolved challenge in the poultry industry. Given the critical role of gut health in chicken performance and welfare, there is a pressing need to identify effective gut health intervention (GHI) strategies to ensure optimal outcomes in poultry farming. In this study, across three broiler production cycles, we compared the metagenomes and performance of broilers provided with ionophores (as the control group) against birds subjected to five different GHI combinations involving vaccination, probiotics, prebiotics, essential oils, and reduction of ionophore use.

View Article and Find Full Text PDF

The COVID-19 pandemic has underscored the importance of virus surveillance in public health and wastewater-based epidemiology (WBE) has emerged as a non-invasive, cost-effective method for monitoring SARS-CoV-2 and its variants at the community level. Unfortunately, current variant surveillance methods depend heavily on updated genomic databases with data derived from clinical samples, which can become less sensitive and representative as clinical testing and sequencing efforts decline.In this paper, we introduce HERCULES (High-throughput Epidemiological Reconstruction and Clustering for Uncovering Lineages from Environmental SARS-CoV-2), an unsupervised method that uses long-read sequencing of a single 1 Kb fragment of the Spike gene.

View Article and Find Full Text PDF

Magnesium (Mg) an essential plant nutrient is widespread deficient in the acidic soils of Nilgiris of Tamil nadu, India. The vegetable yield and quality is especially affected due to deficiency of nutrients like Mg. This study investigates soil characteristics and bacterial diversity in the Nilgiris district of Tamil Nadu, India, with respect to Mg deficiency.

View Article and Find Full Text PDF

Population studies provide insights into the interplay between the gut microbiome and geographical, lifestyle, genetic and environmental factors. However, low- and middle-income countries, in which approximately 84% of the world's population lives, are not equitably represented in large-scale gut microbiome research. Here we present the AWI-Gen 2 Microbiome Project, a cross-sectional gut microbiome study sampling 1,801 women from Burkina Faso, Ghana, Kenya and South Africa.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!