Background: Differential abundance testing is an important aspect of microbiome data analysis, where each taxa is fitted with a statistical test or a regression model. However, many models do not provide a good fit to real microbiome data. This has been shown to result in high false positive rates. Permutation tests are a good alternative, but a regression approach is desired for small data sets with many covariates, where stratification is not an option.

Results: We implement an R package 'llperm' where the The Permutation of Regressor Residuals (PRR) test can be applied to any likelihood based model, not only generalized linear models. This enables distributions with zero-inflation and overdispersion, making the test suitable for count regression models popular in microbiome data analysis. Simulations based on a real data set show that the PRR-test approach is able to maintain the correct nominal false positive rate expected from the null hypothesis, while having equal or greater power to detect the true positives as models based on likelihood at a given false positive rate.

Conclusions: Standard count regression models can have a shockingly high false positive rate in microbiome data sets. As they may lead to false conclusions, the guaranteed nominal false positive rate gained from the PRR-test can be viewed as a major benefit.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9743778PMC
http://dx.doi.org/10.1186/s12859-022-05088-wDOI Listing

Publication Analysis

Top Keywords

microbiome data
20
false positive
20
positive rate
12
permutation regressor
8
regressor residuals
8
data analysis
8
high false
8
data sets
8
count regression
8
regression models
8

Similar Publications

Background: Non-human primates (NHP) serve as an important bridge for testing therapeutic agents that have been previously shown to be effective in transgenic mouse models. Our earlier published data using an NHP model of sporadic AD-related pathology that develops abundant cerebral amyloid angiopathy (CAA), squirrel monkeys (SQMs), indicates that chronic treatment with TLR9 agonist, class B CpG ODN, safely ameliorates CAA while promoting cognitive benefits. In the present study, we intended to delineate alterations in brain metabolome induced by chronic CpG ODN administration in order to provide further insight into CpG ODN immunomodulatory capabilities.

View Article and Find Full Text PDF

Background: Spousal care partners to people with dementia (PWD) have a higher rate of depression and anxiety when compared to similar age controls. Previous studies have suggested a role of gut microbiota in the pathophysiology of neuropsychiatric symptoms and Alzheimer's disease (AD). Thus, our study aims to: (1) determine the presence and severity of depression and anxiety in care partners of PWD, and (2) determine the concentrations of short chain fatty acids (SCFA), which are mainly produced by gut microbiota and are important in mediating gut microbiota effects, in the blood of care partners of PWD.

View Article and Find Full Text PDF

Background: Gut microbiota-derived metabolite Trimethylamine-N-oxide (TMAO) is increasingly recognized as a potential novel prognostic biomarker for cardiovascular disease. Our research work aimed to investigate the potential utility of TMAO measurement in patients with STelevation Myocardial Infarction (STEMI).

Methods: We performed a systematic literature search in PubMed from inception to the 1st of February 2024 to identify all studies examining the association between plasma TMAO levels and disease complexity or clinical outcomes in STEMI patients.

View Article and Find Full Text PDF

Global trends and risk factors in gastric cancer: a comprehensive analysis of the Global Burden of Disease Study 2021 and multi-omics data.

Int J Med Sci

January 2025

Medical Oncology Department of Gastrointestinal Cancer, Cancer Hospital of Dalian University of Technology, Liaoning Cancer Hospital & Institute, No.44 Xiaoheyan Road, Dadong District, Shenyang 110042, Liaoning Province, China.

Gastric cancer (GC) remains a significant global health challenge. This study aimed to comprehensively analyze GC epidemiology and risk factors to inform prevention and intervention strategies. We analyzed the Global Burden of Disease Study 2021 data, conducted 16 different machine learning (ML) models of NHANES data, performed Mendelian randomization (MR) studies on disease phenotypes, dietary preferences, microbiome, blood-based markers, and integrated differential gene expression and expression quantitative trait loci (eQTL) data from multiple cohorts to identify factors associated with GC risk.

View Article and Find Full Text PDF

stana: an R package for metagenotyping analysis and interactive application based on clinical data.

NAR Genom Bioinform

March 2025

Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan.

Metagenotyping of metagenomic data has recently attracted increasing attention as it resolves intraspecies diversity by identifying single nucleotide variants. Furthermore, gene copy number analysis within species provides a deeper understanding of metabolic functions in microbial communities. However, a platform for examining metagenotyping results based on relevant grouping data is lacking.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!