A Bayesian Model for SNP Discovery Based on Next-Generation Sequencing Data.

IEEE Int Workshop Genomic Signal Process Stat

Department of Bioinformatics and Computational Biology, The University of Texas, MD Anderson Cancer Center Houston, TX.

Published: December 2012

A single-nucleotide polymorphism (SNP) is a single base change in the DNA sequence and is the most common polymorphism. Since some SNPs have a major influence on disease susceptibility, detecting SNPs plays an important role in biomedical research. To take fully advantage of the next-generation sequencing (NGS) technology and detect SNP more effectively, we propose a Bayesian approach that computes a posterior probability of hidden nucleotide variations at each covered genomic position. The position with higher posterior probability of hidden nucleotide variation has a higher chance to be a SNP. We apply the proposed method to detect SNPs in two cell lines: the prostate cancer cell line PC3 and the embryonic stem cell line H1. A comparison between our results with dbSNP database shows a high ratio of overlap (>95%). The positions that are called only under our model but not in dbSNP may serve as candidates for new SNPs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4697941PMC
http://dx.doi.org/10.1109/GENSIPS.2012.6507722DOI Listing

Publication Analysis

Top Keywords

next-generation sequencing
8
posterior probability
8
probability hidden
8
hidden nucleotide
8
bayesian model
4
snp
4
model snp
4
snp discovery
4
discovery based
4
based next-generation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!