DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification.

Science

New England Biolabs Inc., 240 County Road, Ipswich, MA 01938-2723, USA.

Published: February 2017

Mutations in somatic cells generate a heterogeneous genomic population and may result in serious medical conditions. Although cancer is typically associated with somatic variations, advances in DNA sequencing indicate that cell-specific variants affect a number of phenotypes and pathologies. Here, we show that mutagenic damage accounts for the majority of the erroneous identification of variants with low to moderate (1 to 5%) frequency. More important, we found signatures of damage in most sequencing data sets in widely used resources, including the 1000 Genomes Project and The Cancer Genome Atlas, establishing damage as a pervasive cause of sequencing errors. The extent of this damage directly confounds the determination of somatic variants in these data sets.

Download full-text PDF

Source
http://dx.doi.org/10.1126/science.aai8690DOI Listing

Publication Analysis

Top Keywords

damage pervasive
8
pervasive sequencing
8
sequencing errors
8
data sets
8
dna damage
4
sequencing
4
errors directly
4
directly confounding
4
confounding variant
4
variant identification
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!