AI Article Synopsis

  • The study explores two methods for assigning taxonomic classifications to operational taxonomic units (OTUs) from environmental sequencing data: VSEARCH (pairwise alignment) and EPA-ng (phylogenetic placement), highlighting differences in their results.
  • It finds that discrepancies between the two methods mainly occur when OTUs have low similarity to reference databases, suggesting potential challenges in accurate taxonomic classification at the subtaxon level.
  • The research emphasizes using GAPPA alongside EPA-ng for better evaluation of taxonomic assignments, particularly in cases with multiple placement options, and integrates evolutionary and ecological insights of the studied colpodean OTUs.

Article Abstract

Taxonomic assignment of operational taxonomic units (OTUs) is an important bioinformatics step in analyzing environmental sequencing data. Pairwise alignment and phylogenetic-placement methods represent two alternative approaches to taxonomic assignments, but their results can differ. Here we used available colpodean ciliate OTUs from forest soils to compare the taxonomic assignments of VSEARCH (which performs pairwise alignments) and EPA-ng (which performs phylogenetic placements). We showed that when there are differences in taxonomic assignments between pairwise alignments and phylogenetic placements at the subtaxon level, there is a low pairwise similarity of the OTUs to the reference database. We then showcase how the output of EPA-ng can be further evaluated using GAPPA to assess the taxonomic assignments when there exist multiple equally likely placements of an OTU, by taking into account the sum over the likelihood weights of the OTU placements within a subtaxon, and the branch distances between equally likely placement locations. We also inferred the evolutionary and ecological characteristics of the colpodean OTUs using their placements within subtaxa. This study demonstrates how to fully analyze the output of EPA-ng, by using GAPPA in conjunction with knowledge of the taxonomic diversity of the clade of interest.

Download full-text PDF

Source
http://dx.doi.org/10.1111/jeu.12990DOI Listing

Publication Analysis

Top Keywords

taxonomic assignments
16
phylogenetic placements
12
taxonomic
8
taxonomic assignment
8
pairwise alignments
8
placements subtaxon
8
output epa-ng
8
placements
6
interpreting phylogenetic
4
placements taxonomic
4

Similar Publications

Background: Alzheimer's disease (AD) missing heritability remains extensive despite numerous genetic risk loci identified by genome-wide association or sequencing studies. This has been attributed, at least partially, to mechanisms not currently investigated by traditional single-marker/gene approaches. Polygenic Risk Scores (PRS) aggregate sparse genetic information across the genome to identify individual genetic risk profiles for disease prediction and patient risk stratification.

View Article and Find Full Text PDF

Assessing the Comorbidity Cycle Between Psoriasis and Addiction Based on ICD Coding in the Stockholm Psoriasis Cohort.

Acta Derm Venereol

January 2025

Technical University of Munich, TUM School of Medicine and Health, Department of Dermatology and Allergy, Munich, Germany; Division of Dermatology and Venereology, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden.

The comorbidity cycle between psoriasis and addictions remains unclear. The study aimed to investigate the cumulative incidence of addictions in psoriasis patients and controls in the Stockholm Psoriasis Cohort (SPC). The SPC is an observational cohort study that enrolled psoriasis patients between 2001 and 2005 and matched controls using the Swedish Total Population Register.

View Article and Find Full Text PDF

Background: The International Classification of Functioning, Disability and Health (ICF) is the worldwide terminology system for measuring health and disability at both individual and population levels. However, the underlying challenges remain in achieving widespread adoption and implementation of ICF within healthcare contexts, including the unequal interval scale of the ICF Likert-type qualifier system ranging from 0 to 4, a lack of consensus on conceptualization and grading criteria of ICF items, and an excessive number of ICF items associated with each disease. The utilization of item response theory (IRT) for ICF studies demonstrated potential benefits in addressing these issues.

View Article and Find Full Text PDF

A dual-stage model for classifying Parkinson's disease severity, through a detailed analysis of Gait signals using force sensors and machine learning approaches, is proposed in this study. Parkinson's disease is the primary neurodegenerative disorder that results in a gradual reduction in motor function. Early detection and monitoring of the disease progression is highly challenging due to the gradual progression of symptoms and the inadequacy of conventional methods in identifying subtle changes in mobility.

View Article and Find Full Text PDF

Molecular structure prediction and homology detection offer promising paths to discovering protein function and evolutionary relationships. However, current approaches lack statistical reliability assurances, limiting their practical utility for selecting proteins for further experimental and in-silico characterization. To address this challenge, we introduce a statistically principled approach to protein search leveraging principles from conformal prediction, offering a framework that ensures statistical guarantees with user-specified risk and provides calibrated probabilities (rather than raw ML scores) for any protein search model.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!