As reliable, efficient genome sequencing becomes ubiquitous, the need for similarly reliable and efficient variant calling becomes increasingly important. The Genome Analysis Toolkit (GATK), maintained by the Broad Institute, is currently the widely accepted standard for variant calling software. However, alternative solutions may provide faster variant calling without sacrificing accuracy. One such alternative is Sentieon DNASeq, a toolkit analogous to GATK but built on a highly optimized backend. We conducted an independent evaluation of the DNASeq single-sample variant calling pipeline in comparison to that of GATK. Our results support the near-identical accuracy of the two software packages, showcase optimal scalability and great speed from Sentieon, and describe computational performance considerations for the deployment of DNASeq.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710408 | PMC |
http://dx.doi.org/10.3389/fgene.2019.00736 | DOI Listing |
JCI Insight
December 2024
Department of Ophthalmology and Roger and Karalis Johnson Retina Center, University of Washington, Seattle, United States of America.
Background: Current clinical sequencing methods cannot effectively detect DNA methylation and allele-specific variation to provide parent-of-origin information from the proband alone. Parent-of-origin effects can lead to differential disease and the inability to assign this in de novo cases limits prognostication in the majority of affected individuals with retinoblastoma, a hereditary cancer with suspected parent-of-origin effects.
Methods: To directly assign parent-of-origin in retinoblastoma patients, genomic DNA was extracted from blood samples for sequencing using a programmable, targeted single-molecule long-read DNA genomic and epigenomic approach.
Bioinformatics
December 2024
Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States.
Motivation: The Variant Call Format (VCF) is widely used in genome sequencing but scales poorly. For instance, we estimate a 150,000 genome VCF would occupy 900 TiB, making it costly and complicated to produce, analyze, and store. The issue stems from VCF's requirement to densely represent both reference-genotypes and allele-indexed arrays.
View Article and Find Full Text PDFParasit Vectors
December 2024
Department of Biology, College of Arts and Sciences, Baylor University, Waco, TX, USA.
Background: The high burden of malaria in Africa is largely due to the presence of competent and adapted Anopheles vector species. With invasive Anopheles stephensi implicated in malaria outbreaks in Africa, understanding the genomic basis of vector-parasite compatibility is essential for assessing the risk of future outbreaks due to this mosquito. Vector compatibility with P.
View Article and Find Full Text PDFSTAR Protoc
December 2024
Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain; Institució Catalana per la Recerca i Estudis Avançats (ICREA), Barcelona, Spain. Electronic address:
The interoperability of variant identification pipelines is fundamental for achieving consistent clinical care across oncology research centers and hospitals. Here, we present a protocol for using ONCOLINER, a platform for the assessment, improvement, and harmonization of somatic variant discovery of multiple pipelines. We describe steps for acquiring benchmarking datasets and executing the user variant calling pipeline.
View Article and Find Full Text PDFNAR Genom Bioinform
December 2024
M.G. DeGroote Institute for Infectious Disease Research, McMaster University, 1280 Main Street West, Hamilton, Ontario, L8S 4K1, Canada.
The incorporation of sequencing technologies in frontline and public health healthcare settings was vital in developing virus surveillance programs during the Coronavirus Disease 2019 (COVID-19) pandemic caused by transmission of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, increased data acquisition poses challenges for both rapid and accurate analyses. To overcome these hurdles, we developed the SARS-CoV-2 Illumina GeNome Assembly Line (SIGNAL) for quick bulk analyses of Illumina short-read sequencing data.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!