Background: With diminishing costs of next generation sequencing (NGS), whole genome analysis becomes a standard tool for identifying genetic causes of inherited diseases. Commercial NGS service providers in general not only provide raw genomic reads, but further deliver SNP calls to their clients. However, the question for the user arises whether to use the SNP data as is, or process the raw sequencing data further through more sophisticated SNP calling pipelines with more advanced algorithms.
Results: Here we report a detailed comparison of SNPs called using the popular GATK multiple-sample calling protocol to SNPs delivered as part of a 40x whole genome sequencing project by Illumina Inc of 171 human genomes of Arab descent (108 unrelated Qatari genomes, 19 trios, and 2 families with rare diseases) and compare them to variants provided by the Illumina CASAVA pipeline. GATK multi-sample calling identifies more variants than the CASAVA pipeline. The additional variants from GATK are robust for Mendelian consistencies but weak in terms of statistical parameters such as TsTv ratio. However, these additional variants do not make a difference in detecting the causative variants in the studied phenotype.
Conclusion: Both pipelines, GATK multi-sample calling and Illumina CASAVA single sample calling, have highly similar performance in SNP calling at the level of putatively causative variants.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216909 | PMC |
http://dx.doi.org/10.1186/1756-0500-7-747 | DOI Listing |
Front Immunol
December 2024
Biofrontiers Institute, University of Colorado Boulder, Boulder, CO, United States.
Background: Understanding genetic underpinnings of immune-mediated inflammatory diseases is crucial to improve treatments. Single-cell RNA sequencing (scRNA-seq) identifies cell states expanded in disease, but often overlooks genetic causality due to cost and small genotyping cohorts. Conversely, large genome-wide association studies (GWAS) are commonly accessible.
View Article and Find Full Text PDFGenome
December 2024
Indian Institute of Agricultural Biotechnology, Ranchi, Jharkhand, India;
The domestication of Emu (Dromaius novaehollandiae) began in the 1970s, but their productive characteristics have not undergone significant genetic enhancement. This study investigated the inbreeding and genetic diversity of 50 emus from various farms in Japan using Double digest restriction-site associated DNA sequencing (ddRAD-seq) markers. Single nucleotide polymorphism (SNP) calling revealed 1,71,975 high-quality SNPs while runs of homozygosity (ROH) analysis identified 1,843 homozygous segments, with an average of 36.
View Article and Find Full Text PDFGenes (Basel)
October 2024
Department of Plant and Soil Science, Texas Tech University, Lubbock, TX 79409, USA.
The identification of informative molecular markers is useful for linkage mapping and can benefit genome-wide association studies by providing fine-scale information about sequence variations. However, high-throughput genotyping approaches are not cost-effective for labs that require frequent use, such as breeding programs that need to perform genotyping on large populations with hundreds of individuals. The number of single nucleotide polymorphism markers generated by those approaches can be far more than needed for most breeding programs; instead, breeders focus on the use of at most hundreds of polymorphic molecular markers for analysis.
View Article and Find Full Text PDFBMC Plant Biol
November 2024
Department of Genetics, "Luiz de Queiroz" College of Agriculture, São Paulo University, (ESALQ/USP), Piracicaba, Brazil.
Forensic Sci Int Genet
February 2025
Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Center for Computational Biology, University of California, Berkeley, Berkeley, CA, United States. Electronic address:
Advancements in sequencing and laboratory technologies have enabled forensic genetic analysis on increasingly low quality and degraded DNA samples. However, existing computational methods applied to genotyping and imputation for generating DNA profiles from degraded DNA have not been tested for forensic applications. Here we simulated sequencing data of varying qualities-coverage, fragment lengths, and deamination patterns-from forty individuals of diverse genetic ancestries.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!