Comparative study of classifiers for human microbiome data.

Med Microecol

Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA.

Published: June 2020

Accumulated evidence has shown that commensal microorganisms play key roles in human physiology and diseases. Dysbiosis of the human-associated microbial communities, often referred to as the human microbiome, has been associated with many diseases. Applying supervised classification analysis to the human microbiome data can help us identify subsets of microorganisms that are highly discriminative and hence build prediction models that can accurately classify unlabeled samples. Here, we systematically compare two state-of-the-art ensemble classifiers: Random Forests (RF), eXtreme Gradient Boosting decision trees (XGBoost) and two traditional methods: The elastic net (ENET) and Support Vector Machine (SVM) in the classification analysis of 29 benchmark human microbiome datasets. We find that XGBoost outperforms all other methods only in a few benchmark datasets. Overall, the XGBoost, RF and ENET display comparable performance in the remaining benchmark datasets. The training time of XGBoost is much longer than others, partially due to the much larger number of hyperparameters in XGBoost. We also find that the most important features selected by the four classifiers partially overlap. Yet, the difference between their classification performance is almost independent of this overlap.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345335PMC
http://dx.doi.org/10.1016/j.medmic.2020.100013DOI Listing

Publication Analysis

Top Keywords

human microbiome
16
microbiome data
8
classification analysis
8
benchmark datasets
8
human
5
xgboost
5
comparative study
4
study classifiers
4
classifiers human
4
microbiome
4

Similar Publications

is a dominant member of the human gut microbiome and produces short-chain fatty acids (SCFAs). These promote immune system function and inhibit inflammation, making this microbe important for human health. Lactate is a primary source of gut SCFAs but its utilization by has not been explored.

View Article and Find Full Text PDF

Modern treatment, a healthy diet, and physical activity routines lower the risk factors for metabolic syndrome; however, this condition is associated with all-cause and cardiovascular mortality worldwide. This investigation involved a randomized controlled trial, double-blind, parallel study. Fifty-eight participants with risk factors of metabolic syndrome according to the inclusion criteria were randomized into two groups and given probiotics (Lacticaseibacillus paracasei MSMC39-1 and Bifidobacterium animalis TA-1) (n = 31) or a placebo (n = 27).

View Article and Find Full Text PDF

Aging-induced Alternation in the Gut Microbiota Impairs Host Antibacterial Defense.

Adv Sci (Weinh)

January 2025

Department of Critical Care Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, 510280, China.

Older individuals experience increased susceptibility and mortality to bacterial infections, but the underlying etiology remains unclear. Herein, it is shown that aging-associated reduction of commensal Parabacteroides goldsteinii (P. goldsteinii) in both aged mice and humans critically contributes to worse outcomes of bacterial infection.

View Article and Find Full Text PDF

Differential Responses of Methylobacterium and Sphingomonas Species to Multispecies Interactions in the Phyllosphere.

Environ Microbiol

January 2025

Institute of Microbiology and Dahlem Centre of Plant Sciences, Department of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Berlin, Germany.

The leaf surface, known as the phylloplane, presents an oligotrophic and heterogeneous environment due to its topography and uneven distribution of resources. Although it is a challenging environment, leaves support abundant bacterial communities that are spatially structured. However, the factors influencing these spatial distribution patterns are not well understood.

View Article and Find Full Text PDF

Tumor-infiltrating lymphocytes (TILs) are key components of the tumor microenvironment (TME) and serve as prognostic markers for breast cancer. Patients with high TIL infiltration generally experience better clinical outcomes and extended survival compared to those with low TIL infiltration. However, as the TME is highly complex and TIL subtypes perform distinct biological functions, TILs may only provide an approximate indication of tumor immune status, potentially leading to biased prognostic results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!