Until recently, single nucleotide polymorphism (SNP) discovery in nonmodel organisms faced many challenges, often depending upon a targeted-gene approach and Sanger sequencing of many individuals. The advent of next-generation sequencing technologies has dramatically improved discovery, but validating and testing SNPs for use in population studies remain labour intensive. Here, we detail a SNP discovery and validation pipeline that incorporates 454 pyrosequencing, high-resolution melt analysis (HRMA) and 5' nuclease genotyping. We generated 4.59×10(8) bp of redundant sequence from transcriptomes of two individual chum salmon, a highly valued species across the Pacific Rim. Nearly 26000 putative SNPs were identified--some as heterozygotes and some as homozygous for different nucleotides in the two individuals. For validation, we selected 202 templates containing single putative SNPs and conducted HRMA on 10 individuals from each of 19 populations from across the species range. Finally, 5' nuclease genotyping validated 37 SNPs that conformed to Hardy-Weinberg equilibrium expectations. Putative SNPs expressed as heterozygotes in an ascertainment individual had more than twice the validation rate of those homozygous for different alleles in the two fish, suggesting that many of the latter may have been paralogous sequence variants. Overall, this validation rate of 37/202 suggests that we have found more than 4500 templates containing SNPs for use in this population set. We anticipate using this pipeline to significantly expand the number of SNPs available for the studies of population structure and mixture analyses as well as for the studies of adaptive genetic variation in nonmodel organisms.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1111/j.1755-0998.2010.02936.x | DOI Listing |
Infect Prev Pract
March 2025
Department of Medicine, University of Cambridge, Box 157 Addenbrooke's Hospital, Hills Road, Cambridge, CB2 0QQ, UK.
Antibiograms have been used during outbreak investigations for decades as a surrogate for genetic relatedness of Methicillin-resistant (MRSA). In this study, we evaluate the accuracy of antibiograms in detecting transmission, using genomic epidemiology as the reference standard. We analysed epidemiological and genomic data from 1,465 patients and 1,465 MRSA isolates collected at a single clinical microbiology laboratory in the United Kingdom over a one-year period.
View Article and Find Full Text PDFJ Hered
January 2025
The State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies; Institute of Ecology, Peking University, Beijing 100871, China.
In the fall of 2003, a two-year-old tiger named Ming, weighing some four hundred pounds, was discovered living in an apartment in Harlem, New York. Ming's rescue by NYPD was witnessed, recalled, and venerated by scores of neighbors. The tiger's history and ancestry stimulated considerable media interest, investigative sleuthing, and forensic genomic analyses.
View Article and Find Full Text PDFGenes (Basel)
January 2025
Division of Genetics and Cardiovascular Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02115, USA.
Low-density lipoprotein cholesterol (LDL-C) is a well-established risk factor for cardiovascular disease, and it plays a causal role in the development of atherosclerosis. Genome-wide association studies (GWASs) have successfully identified hundreds of genetic variants associated with LDL-C. Most of these risk loci fall in non-coding regions of the genome, and it is unclear how these non-coding variants affect circulating lipid levels.
View Article and Find Full Text PDFSci Rep
January 2025
Breeding Informatics Group, Department of Animal Sciences, Georg-August University, 37075, Göttingen, Germany.
In the last two decades there has been growing interest in the analysis of ancient DNA obtained from the parchment used in historic documents. The genetic insight that this data provides makes collections of historic documents an invaluable source for studying the development and spread of historical livestock populations. Additionally, the biological data may provide new information for the historical analysis that could be used to determine the provenance as well as the authenticity of these documents.
View Article and Find Full Text PDFBMC Genomics
January 2025
International Institute of Molecular and Cell Biology in Warsaw, Laboratory of Zebrafish Developmental Genomics, Księcia Trojdena 4, Warsaw, 02-109, Poland.
Congenital heart disease (CHD) is a prevalent condition characterized by defective heart development, causing premature death and stillbirths among infants. Genome-wide association studies (GWASs) have provided insights into the role of genetic variants in CHD pathogenesis through the identification of a comprehensive set of single-nucleotide polymorphisms (SNPs). Notably, 90-95% of these variants reside in the noncoding genome, complicating the understanding of their underlying mechanisms.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!