Human microbiome research characterizes the microbial content of samples from human habitats to learn how interactions between bacteria and their host might impact human health. In this work a novel parametric statistical inference method based on object-oriented data analysis (OODA) for analyzing HMP data is proposed. OODA is an emerging area of statistical inference where the goal is to apply statistical methods to objects such as functions, images, and graphs or trees. The data objects that pertain to this work are taxonomic trees of bacteria built from analysis of 16S rRNA gene sequences (e.g. using RDP); there is one such object for each biological sample analyzed. Our goal is to model and formally compare a set of trees. The contribution of our work is threefold: first, a weighted tree structure to analyze RDP data is introduced; second, using a probability measure to model a set of taxonomic trees, we introduce an approximate MLE procedure for estimating model parameters and we derive LRT statistics for comparing the distributions of two metagenomic populations; and third the Jumpstart HMP data is analyzed using the proposed model providing novel insights and future directions of analysis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494672PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0048996PLOS

Publication Analysis

Top Keywords

taxonomic trees
12
data analysis
8
human microbiome
8
statistical inference
8
hmp data
8
data
7
trees
5
statistical
4
statistical object
4
object data
4

Similar Publications

Groundwater resources constitute one of the primary sources of freshwater in semi-arid and arid climates. Monitoring the groundwater quality is an essential component of environmental management. In this study, a comprehensive comparison was conducted to analyze the performance of nine ensembles and regular machine learning (ML) methods in predicting two water quality parameters including total dissolved solids (TDS) and pH, in an area with semi-arid climate conditions.

View Article and Find Full Text PDF

The two sides of Phobos: Gray and white matter abnormalities in phobic individuals.

Cogn Affect Behav Neurosci

January 2025

Departamento de Psicología ClínicaPsicobiología y MetodologíaFacultad de Psicología, Universidad de La Laguna, La Laguna, 38200, Tenerife, Spain.

Small animal phobia (SAP) is a subtype of specific phobia characterized by an intense and irrational fear of small animals, which has been underexplored in the neuroscientific literature. Previous studies often faced limitations, such as small sample sizes, focusing on only one neuroimaging modality, and reliance on univariate analyses, which produced inconsistent findings. This study was designed to overcome these issues by using for the first time advanced multivariate machine-learning techniques to identify the neural mechanisms underlying SAP.

View Article and Find Full Text PDF

Golden camellia species are endangered species with great ecological significance and economic value in the section Chrysantha of the genus Camellia of the family Theaceae. Literature shows that more than 50 species of golden camellia have been found all over the world, but the exact number remains undetermined due to the complex phylogenetic background, the non-uniform classification criteria, and the presence of various synonyms and homonyms; and phylogenetic relationships among golden camellia species at the gene level are yet to be disclosed. Therefore, it is necessary to investigate the divergence time and phylogenetic relationships between all golden camellia species at the gene level to improve their classification system and achieve accurate identification of them.

View Article and Find Full Text PDF

A classification prediction model is established based on a nonlinear method-Gradient Boosting Decision Tree (GBDT) to investigate the factors contributing to a perpetrator's escape behavior in hit-and-run crashes. Given the U.S.

View Article and Find Full Text PDF

Transformations to Simplify Phylogenetic Networks.

Bull Math Biol

January 2025

Biomathematics Research Centre, University of Canterbury, Christchurch, New Zealand.

The evolutionary relationships between species are typically represented in the biological literature by rooted phylogenetic trees. However, a tree fails to capture ancestral reticulate processes, such as the formation of hybrid species or lateral gene transfer events between lineages, and so the history of life is more accurately described by a rooted phylogenetic network. Nevertheless, phylogenetic networks may be complex and difficult to interpret, so biologists sometimes prefer a tree that summarises the central tree-like trend of evolution.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!