Though statistical normalizations are often used in differential abundance or differential expression analysis to address sample-to-sample variation in sequencing depth, we offer a better alternative. These normalizations often make strong, implicit assumptions about the scale of biological systems (e.g., microbial load). Thus, analyses are susceptible to even slight errors in these assumptions, leading to elevated rates of false positives and false negatives. We introduce scale models as a generalization of normalizations so researchers can model potential errors in assumptions about scale. By incorporating scale models into the popular ALDEx2 software, we enhance the reproducibility of analyses while often drastically decreasing false positive and false negative rates. We design scale models that are guaranteed to reduce false positives compared to equivalent normalizations. At least in the context of ALDEx2, we recommend using scale models over normalizations in all practical situations.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11014594 | PMC |
http://dx.doi.org/10.1101/2024.04.01.587602 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!