DRISEE overestimates errors in metagenomic sequencing data.

A Murat Eren Hilary G Morrison Susan M Huse Mitchell L Sogin

Brief Bioinform

Published: September 2014

The extremely high error rates reported by Keegan et al. in 'A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE' (PLoS Comput Biol 2012; 8: :e1002541) for many next-generation sequencing datasets prompted us to re-examine their results. Our analysis reveals that the presence of conserved artificial sequences, e.g. Illumina adapters, and other naturally occurring sequence motifs accounts for most of the reported errors. We conclude that DRISEE reports inflated levels of sequencing error, particularly for Illumina data. Tools offered for evaluating large datasets need scrupulous review before they are implemented.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4171678	PMC
http://dx.doi.org/10.1093/bib/bbt010	DOI Listing

Publication Analysis

Top Keywords

errors metagenomic

metagenomic sequencing

sequencing data

drisee overestimates

overestimates errors

sequencing

data extremely

extremely high

high error

error rates

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!