Efficient and Accurate Inference of Mixed Microbial Population Trajectories from Longitudinal Count Data.

Cell Syst

Department of Computer Science, Columbia University, New York, NY 10027, USA; Department of Systems Biology, Columbia University, New York, NY 10027, USA; Data Science Institute, Columbia University, New York, NY 10027, USA. Electronic address:

Published: June 2020

The recently completed second phase of the Human Microbiome Project has highlighted the relationship between dynamic changes in the microbiome and disease, motivating new microbiome study designs based on longitudinal sampling. Yet, analysis of such data is hindered by presence of technical noise, high dimensionality, and data sparsity. Here, we introduce LUMINATE (longitudinal microbiome inference and zero detection), a fast and accurate method for inferring relative abundances from noisy read count data. We demonstrate that LUMINATE is orders of magnitude faster than current approaches, with better or similar accuracy. We further show that LUMINATE can accurately distinguish biological zeros, when a taxon is absent from the community, from technical zeros, when a taxon is below the detection threshold. We conclude by demonstrating the utility of LUMINATE on a real dataset, showing that LUMINATE smooths trajectories observed from noisy data. LUMINATE is freely available from https://github.com/tyjo/luminate.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cels.2020.05.006DOI Listing

Publication Analysis

Top Keywords

count data
8
zeros taxon
8
luminate
6
data
5
efficient accurate
4
accurate inference
4
inference mixed
4
mixed microbial
4
microbial population
4
population trajectories
4

Similar Publications

Motivation: Understanding the associations between traits and microbial composition is a fundamental objective in microbiome research. Recently, researchers have turned to machine learning (ML) models to achieve this goal with promising results. However, the effectiveness of advanced ML models is often limited by the unique characteristics of microbiome data, which are typically high-dimensional, compositional, and imbalanced.

View Article and Find Full Text PDF

Background: Endodontic treatment aims in the preservation of extremely carious primary teeth. For root canal therapy to be successful, root canals must be properly prepared and effectively irrigated .Therefore, it is necessary to select the proper root canal disinfection method to preserve the primary tooth.

View Article and Find Full Text PDF

Background: The health benefits of physical activity, including walking, are well-established, but the relationship between daily step count and mortality in hypertensive populations remains underexplored. This study investigates the association between daily step count and both all-cause and cardiovascular mortality in hypertensive American adults.

Methods: We used data from the National Health and Nutrition Examination Survey 2005-2006, including 1,629 hypertensive participants with accelerometer-measured step counts.

View Article and Find Full Text PDF

Background: Abnormal results in commonly used primary care blood tests could be early markers of cancer in patients presenting with non-specific abdominal symptoms.

Methods: Using linked data from the UK Clinical Practice Research Datalink (CPRD) and national cancer registry we compared blood test use and abnormal results from the 24-months pre-diagnosis in 10,575 cancer patients (any site), and 52,875 matched-controls aged ≥30 presenting, with abdominal pain or bloating to primary care.

Results: Cancer patients had two-fold increased odds of having a blood test (odds ratio(OR):1.

View Article and Find Full Text PDF

In order to reduce the number of parameters in the Chinese herbal medicine recognition model while maintaining accuracy, this paper takes 20 classes of Chinese herbs as the research object and proposes a recognition network based on knowledge distillation and cross-attention - ShuffleCANet (ShuffleNet and Cross-Attention). Firstly, transfer learning was used for experiments on 20 classic networks, and DenseNet and RegNet were selected as dual teacher models. Then, considering the parameter count and recognition accuracy, ShuffleNet was determined as the student model, and a new cross-attention mechanism was proposed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!