Estimation of sequencing error rates in short reads.

BMC Bioinformatics

Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA.

Published: July 2012

Background: Short-read data from next-generation sequencing technologies are now being generated across a range of research projects. The fidelity of this data can be affected by several factors and it is important to have simple and reliable approaches for monitoring it at the level of individual experiments.

Results: We developed a fast, scalable and accurate approach to estimating error rates in short reads, which has the added advantage of not requiring a reference genome. We build on the fundamental observation that there is a linear relationship between the copy number for a given read and the number of erroneous reads that differ from the read of interest by one or two bases. The slope of this relationship can be transformed to give an estimate of the error rate, both by read and by position. We present simulation studies as well as analyses of real data sets illustrating the precision and accuracy of this method, and we show that it is more accurate than alternatives that count the difference between the sample of interest and a reference genome. We show how this methodology led to the detection of mutations in the genome of the PhiX strain used for calibration of Illumina data. The proposed method is implemented in an R package, which can be downloaded from http://bcb.dfci.harvard.edu/∼vwang/shadowRegression.html.

Conclusions: The proposed method can be used to monitor the quality of sequencing pipelines at the level of individual experiments without the use of reference genomes. Furthermore, having an estimate of the error rates gives one the opportunity to improve analyses and inferences in many applications of next-generation sequencing data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3495688PMC
http://dx.doi.org/10.1186/1471-2105-13-185DOI Listing

Publication Analysis

Top Keywords

error rates
12
rates short
8
short reads
8
next-generation sequencing
8
level individual
8
reference genome
8
estimate error
8
proposed method
8
data
5
estimation sequencing
4

Similar Publications

Pre-enrichment-free electrochemical detection of lead ions using functionalized tungsten oxide: Integration of surface functionalization and redox cycling mechanisms.

Talanta

March 2025

State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun, 130000, PR China. Electronic address:

Methods for electrochemical detection of heavy metal ions have garnered widespread attention due to their high sensitivity, ease of operation, low cost, and suitability for on-site detection. However, these methods typically require a pre-enrichment step to improve the detection limit and sensitivity, which increases operational complexity and introduces potential errors. In this study, tungsten oxide electrodes with various functional groups were prepared by electrodeposition and high-temperature annealing, utilizing the amphoteric properties of l-alanine.

View Article and Find Full Text PDF

Medical image-based computational fluid dynamics (CFD) is a valuable tool for studying cardiovascular hemodynamics and its role in vascular pathologies. However, patient-specific flow rate measurements are rare. As a remedy, individual flow rates are typically estimated using anatomical features.

View Article and Find Full Text PDF

Assessing the value of environmental DNA into conservation planning: A case study of freshwater bivalves in France.

J Environ Manage

March 2025

CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661, Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, 4485-661, Vairão, Portugal; Institute of Systematics, Evolution, Biodiversity (ISYEB), National Museum of Natural History (MNHN), CNRS, SU, EPHE, UA, CP 51, 57 rue Cuvier, 75005, Paris, France; SPYGEN, 17, rue du Lac Saint-André, Savoie Technolac, 73375, Le Bourget du Lac, Cedex, France.

Environmental DNA (eDNA) is emerging as a valuable tool for generating standardised biodiversity data. Whether used alone or alongside conventional surveys, it is expected that eDNA data can improve conservation planning, but this remains largely untested. Here, we assessed the value of eDNA data for making more robust conservation prioritisation decisions, by comparing errors made when identifying priority areas based on conventional data, eDNA or a combination of both.

View Article and Find Full Text PDF

Estimating the time since death, or the postmortem interval (PMI), is a significant component of forensic anthropological analysis when human remains are discovered. Despite decades of research, methods for providing an estimate of the PMI with known error rates are still absent from applied medicolegal forensic work, which prompts the necessity for a critical examination of the state of the field. This review details the history of how forensic anthropology emerged from the broader discipline of biological anthropology, with a specific focus on how forensic anthropologists came to be understood as suitable experts for estimating the PMI.

View Article and Find Full Text PDF

Objective: The primary aim of this investigation is to assess the effectiveness of implementing an innovative immobilization approach, spec ically the utilization of personalized open-face masks in combination with styrofoam fixation, for head and neck cancers receiving radiotherapy. The study seeks to evaluate the influence of this method on improving patients' precision in positioning and their overall comfort during the treatment process, in addition to exploring its potential capacity to mitigate the occurrence of anxiety and depression in this patient population.

Methods: A prospective, randomized controlled trial was undertaken to investigate the comparative efficacy of two immobilization approaches for the radiotherapy treatment of head and neck cancers.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!