AI Article Synopsis

  • A deep convolutional neural network, named DeepVariant, has been developed to accurately identify genetic variants from complex sequencing data.
  • The model learns to recognize patterns in read pileups, significantly outperforming existing tools in variant calling.
  • DeepVariant is versatile, effectively working across different genome builds and species, and is applicable to various sequencing technologies, simplifying the variant identification process.

Article Abstract

Despite rapid advances in sequencing technologies, accurately calling genetic variants present in an individual genome from billions of short, errorful sequence reads remains challenging. Here we show that a deep convolutional neural network can call genetic variation in aligned next-generation sequencing read data by learning statistical relationships between images of read pileups around putative variant and true genotype calls. The approach, called DeepVariant, outperforms existing state-of-the-art tools. The learned model generalizes across genome builds and mammalian species, allowing nonhuman sequencing projects to benefit from the wealth of human ground-truth data. We further show that DeepVariant can learn to call variants in a variety of sequencing technologies and experimental designs, including deep whole genomes from 10X Genomics and Ion Ampliseq exomes, highlighting the benefits of using more automated and generalizable techniques for variant calling.

Download full-text PDF

Source
http://dx.doi.org/10.1038/nbt.4235DOI Listing

Publication Analysis

Top Keywords

sequencing technologies
8
universal snp
4
snp small-indel
4
small-indel variant
4
variant caller
4
caller deep
4
deep neural
4
neural networks
4
networks despite
4
despite rapid
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!