AI Article Synopsis

  • Popular naive Bayes classifiers for amplicon sequences incorrectly assume that all species in a reference database are equally likely to appear, leading to decreased accuracy as this assumption is often untrue.
  • By using environment-specific data on taxonomic abundance, species-level classification accuracy significantly improves, with error rates dropping from 25% to 14%.
  • The tool q2-clawback offers a simple solution for enhancing classification in common sample types, making it a better alternative to traditional methods.

Article Abstract

Popular naive Bayes taxonomic classifiers for amplicon sequences assume that all species in the reference database are equally likely to be observed. We demonstrate that classification accuracy degrades linearly with the degree to which that assumption is violated, and in practice it is always violated. By incorporating environment-specific taxonomic abundance information, we demonstrate a significant increase in the species-level classification accuracy across common sample types. At the species level, overall average error rates decline from 25% to 14%, which is favourably comparable to the error rates that existing classifiers achieve at the genus level (16%). Our findings indicate that for most practical purposes, the assumption that reference species are equally likely to be observed is untenable. q2-clawback provides a straightforward alternative for samples from common environments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6789115PMC
http://dx.doi.org/10.1038/s41467-019-12669-6DOI Listing

Publication Analysis

Top Keywords

classification accuracy
12
equally observed
8
error rates
8
species
4
species abundance
4
abundance improves
4
improves sequence
4
sequence taxonomy
4
taxonomy classification
4
accuracy popular
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!