Background: In shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated.
Results: Here, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance.
Conclusions: This study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910605 | PMC |
http://dx.doi.org/10.1186/s12864-018-4637-6 | DOI Listing |
Sci Rep
December 2024
Department of Agronomy, Faculty of Agriculture and Environment, The Islamia University of Bahawalpur, Bahawalpur, 63100, Pakistan.
Climate change has caused many challenges to soil ecosystems, including soil salinity. Consequently, many strategies are advised to mitigate this issue. In this context, biochar is acknowledged as a useful addition that can alleviate the detrimental impacts of salt stress on plants.
View Article and Find Full Text PDFSci Rep
December 2024
School of Psychology, Inner Mongolia Normal University, Hohhot, China.
The purpose of this study was to evaluate the psychometric properties of the Chinese version of the Revised Indebtedness Scale (IS-R-C) in mainland China. A total of 1057 university students participated in this study using a two-wave whole-group sampling method. Sample 1, consisting of 537 participants, was used for item analysis and exploratory factor analysis (EFA) of the Revised Indebtedness Scale (IS-R).
View Article and Find Full Text PDFSci Rep
December 2024
School of Physical Education, Shanghai University of Sport, Shanghai, 200438, China.
Objective: This study aimed to examine the levels of physical activity (PA), sleep, and mental health (MH), specifically depression, anxiety, and stress, among Chinese university students. It also aimed to analyze the influencing factors of MH, providing a theoretical foundation for developing intervention programs to improve college students' mental health.
Methods: A stratified, clustered, and phased sampling method was employed.
Sci Rep
December 2024
Institute for Forest Resources and Environment of Guizhou, College of Forestry, Guizhou University, Guiyang, 550025, Guizhou, China.
This study aims to explore the low phosphorus (P) tolerance of saplings from different Gleditsia sinensis Lam. families. It also seeks to screen for Gleditsia sinensis families with strong low P tolerance and identify key indicators for evaluating their tolerance.
View Article and Find Full Text PDFNat Commun
December 2024
Laboratory of Cellular Biophysics, The Rockefeller University, New York, NY, USA.
Fibrolamellar Hepatocellular Carcinoma (FLC) is a rare liver cancer characterized by a fusion oncokinase of the genes DNAJB1 and PRKACA, the catalytic subunit of protein kinase A (PKA). A few FLC-like tumors have been reported showing other alterations involving PKA. To better understand FLC pathogenesis and the relationships among FLC, FLC-like, and other liver tumors, we performed a massive multi-omics analysis.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!