Until recently, single nucleotide polymorphism (SNP) discovery in nonmodel organisms faced many challenges, often depending upon a targeted-gene approach and Sanger sequencing of many individuals. The advent of next-generation sequencing technologies has dramatically improved discovery, but validating and testing SNPs for use in population studies remain labour intensive. Here, we detail a SNP discovery and validation pipeline that incorporates 454 pyrosequencing, high-resolution melt analysis (HRMA) and 5' nuclease genotyping. We generated 4.59×10(8) bp of redundant sequence from transcriptomes of two individual chum salmon, a highly valued species across the Pacific Rim. Nearly 26000 putative SNPs were identified--some as heterozygotes and some as homozygous for different nucleotides in the two individuals. For validation, we selected 202 templates containing single putative SNPs and conducted HRMA on 10 individuals from each of 19 populations from across the species range. Finally, 5' nuclease genotyping validated 37 SNPs that conformed to Hardy-Weinberg equilibrium expectations. Putative SNPs expressed as heterozygotes in an ascertainment individual had more than twice the validation rate of those homozygous for different alleles in the two fish, suggesting that many of the latter may have been paralogous sequence variants. Overall, this validation rate of 37/202 suggests that we have found more than 4500 templates containing SNPs for use in this population set. We anticipate using this pipeline to significantly expand the number of SNPs available for the studies of population structure and mixture analyses as well as for the studies of adaptive genetic variation in nonmodel organisms.

Download full-text PDF

Source
http://dx.doi.org/10.1111/j.1755-0998.2010.02936.xDOI Listing

Publication Analysis

Top Keywords

putative snps
12
high-resolution melt
8
melt analysis
8
single nucleotide
8
nucleotide polymorphism
8
snp discovery
8
nonmodel organisms
8
snps population
8
nuclease genotyping
8
validation rate
8

Similar Publications

Antibiograms have been used during outbreak investigations for decades as a surrogate for genetic relatedness of Methicillin-resistant (MRSA). In this study, we evaluate the accuracy of antibiograms in detecting transmission, using genomic epidemiology as the reference standard. We analysed epidemiological and genomic data from 1,465 patients and 1,465 MRSA isolates collected at a single clinical microbiology laboratory in the United Kingdom over a one-year period.

View Article and Find Full Text PDF

Forensic Assessment of Kinship, Genomic Ancestry, and Natural History of an Iconic Tiger of Harlem-New York City.

J Hered

January 2025

The State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies; Institute of Ecology, Peking University, Beijing 100871, China.

In the fall of 2003, a two-year-old tiger named Ming, weighing some four hundred pounds, was discovered living in an apartment in Harlem, New York. Ming's rescue by NYPD was witnessed, recalled, and venerated by scores of neighbors. The tiger's history and ancestry stimulated considerable media interest, investigative sleuthing, and forensic genomic analyses.

View Article and Find Full Text PDF

Low-density lipoprotein cholesterol (LDL-C) is a well-established risk factor for cardiovascular disease, and it plays a causal role in the development of atherosclerosis. Genome-wide association studies (GWASs) have successfully identified hundreds of genetic variants associated with LDL-C. Most of these risk loci fall in non-coding regions of the genome, and it is unclear how these non-coding variants affect circulating lipid levels.

View Article and Find Full Text PDF

Genomic analysis of three medieval parchments from German monasteries.

Sci Rep

January 2025

Breeding Informatics Group, Department of Animal Sciences, Georg-August University, 37075, Göttingen, Germany.

In the last two decades there has been growing interest in the analysis of ancient DNA obtained from the parchment used in historic documents. The genetic insight that this data provides makes collections of historic documents an invaluable source for studying the development and spread of historical livestock populations. Additionally, the biological data may provide new information for the historical analysis that could be used to determine the provenance as well as the authenticity of these documents.

View Article and Find Full Text PDF

Congenital heart disease (CHD) is a prevalent condition characterized by defective heart development, causing premature death and stillbirths among infants. Genome-wide association studies (GWASs) have provided insights into the role of genetic variants in CHD pathogenesis through the identification of a comprehensive set of single-nucleotide polymorphisms (SNPs). Notably, 90-95% of these variants reside in the noncoding genome, complicating the understanding of their underlying mechanisms.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!