Genetic variation and the de novo assembly of human genomes.

Nat Rev Genet

Department of Genome Sciences, University of Washington, Foege Building S-413A, Box 355065, 3720 15th Ave NE, Seattle, Washington 98195, USA.

Published: November 2015

The discovery of genetic variation and the assembly of genome sequences are both inextricably linked to advances in DNA-sequencing technology. Short-read massively parallel sequencing has revolutionized our ability to discover genetic variation but is insufficient to generate high-quality genome assemblies or resolve most structural variation. Full resolution of variation is only guaranteed by complete de novo assembly of a genome. Here, we review approaches to genome assembly, the nature of gaps or missing sequences, and biases in the assembly process. We describe the challenges of generating a complete de novo genome assembly using current technologies and the impact that being able to perfectly sequence the genome would have on understanding human disease and evolution. Finally, we summarize recent technological advances that improve both contiguity and accuracy and emphasize the importance of complete de novo assembly as opposed to read mapping as the primary means to understanding the full range of human genetic variation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4745987PMC
http://dx.doi.org/10.1038/nrg3933DOI Listing

Publication Analysis

Top Keywords

genetic variation
16
novo assembly
12
complete novo
12
assembly genome
8
genome assembly
8
assembly
7
genome
6
variation
5
genetic
4
novo
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!