A web application for sample size and power calculation in case-control microbiome studies.

Bioinformatics

Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Coupure Links 653, Gent, 9000 University of Wollongong, National Institute for Applied Statistics Research Australia (NIASRA), School of Mathematics and Applied Statistics, Australia.

Published: July 2016

Unlabelled: : When designing a case-control study to investigate differences in microbial composition, it is fundamental to assess the sample sizes needed to detect an hypothesized difference with sufficient statistical power. Our application includes power calculation for (i) a recoded version of the two-sample generalized Wald test of the 'HMP' R-package for comparing community composition, and (ii) the Wilcoxon-Mann-Whitney test for comparing operational taxonomic unit-specific abundances between two samples (optional). The simulation-based power calculations make use of the Dirichlet-Multinomial model to describe and generate abundances. The web interface allows for easy specification of sample and effect sizes. As an illustration of our application, we compared the statistical power of the two tests, with and without stratification of samples. We observed that statistical power increases considerably when stratification is employed, meaning that less samples are needed to detect the same effect size with the same power.

Availability And Implementation: The web interface is written in R code using Shiny (RStudio Inc., 2016) and it is available at https://fedematt.shinyapps.io/shinyMB The R code for the recoded generalized Wald test can be found at https://github.com/mafed/msWaldHMP CONTACT: Federico.Mattiello@UGent.be.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btw099DOI Listing

Publication Analysis

Top Keywords

statistical power
12
power calculation
8
sample sizes
8
needed detect
8
generalized wald
8
wald test
8
web interface
8
power
6
web application
4
application sample
4

Similar Publications

Background: Alzheimer's Disease (AD) incidence is almost double in female than male, suggesting sex-specific AD risk genes remain unknown.

Method: We designed a statistical physics approach that exploits freely available but massive evolutionary and phylogenetic coupling data on sequence variation and speciation. These couplings lead to quantifiable values for the selection pressure exerted on the genes within a population.

View Article and Find Full Text PDF

Background: Alcohol Use Disorder (AUD) affects over 15 million individuals in the United States, contributing to oxidative stress, neuroinflammation, and elevating the risk of neurodegeneration. Despite this, the connection between AUD and aging conditions, particularly Alzheimer's disease (AD), remains unclear. AD, with a heritability of 60-80%, is genetically linked, necessitating an exploration of the molecular implications of AUD and genetic susceptibility to AD.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Department of Population and Quantitative Health Sciences, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA.

Background: Many independent studies have found rare variants associated with AD. Current gene-based tests for rare-variants generally consider the impact of low-frequency coding variants as an independent effect from the common regulatory variants that surround them. In this work, we propose to increase the statistical power of kernel-based rare-variant association tests by accounting for the surrounding cis-regulatory variants' effects on gene expression.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Department of Population and Quantitative Health Sciences, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA.

Background: Recent advances in Alzheimer's Disease (AD) research have emphasized the importance of recruiting from diverse populations. Notably, African-descent individuals have an almost doubled risk of developing AD compared to European-descent individuals. Transcriptome-wide association studies (TWAS) have advanced the analysis of non-coding variants by integrating gene expression with GWAS data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!