Beyond Normalization: Incorporating Scale Uncertainty in Microbiome and Gene Expression Analysis.

bioRxiv

College of Information Science and Technology, Pennsylvania State University, University Park, PA, USA.

Published: April 2024

Though statistical normalizations are often used in differential abundance or differential expression analysis to address sample-to-sample variation in sequencing depth, we offer a better alternative. These normalizations often make strong, implicit assumptions about the scale of biological systems (e.g., microbial load). Thus, analyses are susceptible to even slight errors in these assumptions, leading to elevated rates of false positives and false negatives. We introduce scale models as a generalization of normalizations so researchers can model potential errors in assumptions about scale. By incorporating scale models into the popular ALDEx2 software, we enhance the reproducibility of analyses while often drastically decreasing false positive and false negative rates. We design scale models that are guaranteed to reduce false positives compared to equivalent normalizations. At least in the context of ALDEx2, we recommend using scale models over normalizations in all practical situations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11014594PMC
http://dx.doi.org/10.1101/2024.04.01.587602DOI Listing

Publication Analysis

Top Keywords

scale models
16
incorporating scale
8
expression analysis
8
assumptions scale
8
errors assumptions
8
false positives
8
scale
7
normalizations
5
false
5
normalization incorporating
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!