SeqControl: process control for DNA sequencing.

Nat Methods

1] Informatics &Biocomputing Platform, Ontario Institute for Cancer Research, Toronto, Ontario, Canada. [2] Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada. [3] Department of Pharmacology and Toxicology, University of Toronto, Toronto, Ontario, Canada.

Published: October 2014

As high-throughput sequencing continues to increase in speed and throughput, routine clinical and industrial application draws closer. These 'production' settings will require enhanced quality monitoring and quality control to optimize output and reduce costs. We developed SeqControl, a framework for predicting sequencing quality and coverage using a set of 15 metrics describing overall coverage, coverage distribution, basewise coverage and basewise quality. Using whole-genome sequences of 27 prostate cancers and 26 normal references, we derived multivariate models that predict sequencing quality and depth. SeqControl robustly predicted how much sequencing was required to reach a given coverage depth (area under the curve (AUC) = 0.993), accurately classified clinically relevant formalin-fixed, paraffin-embedded samples, and made predictions from as little as one-eighth of a sequencing lane (AUC = 0.967). These techniques can be immediately incorporated into existing sequencing pipelines to monitor data quality in real time. SeqControl is available at http://labs.oicr.on.ca/Boutros-lab/software/SeqControl/.

Download full-text PDF

Source
http://dx.doi.org/10.1038/nmeth.3094DOI Listing

Publication Analysis

Top Keywords

sequencing quality
8
sequencing
7
quality
6
coverage
5
seqcontrol
4
seqcontrol process
4
process control
4
control dna
4
dna sequencing
4
sequencing high-throughput
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!