Publications by Gunjan Baid

Publications by authors named "Gunjan Baid"

Page 1 of 1

A draft human pangenome reference.

Wen-Wei Liao Mobin Asri Jana Ebler Daniel Doerr Marina Haukness Gunjan Baid

Nature

May 2023

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels.

View Article and Find Full Text PDF

DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer.

Gunjan Baid Daniel E Cook Kishwar Shafin Taedong Yun Felipe Llinares-López

Nat Biotechnol

February 2023

Article Synopsis

Scientists developed a new way called DeepConsensus to help correct DNA sequences more accurately than an older method called pbccs.
DeepConsensus uses advanced technology to lower errors in the DNA reads by 42%, which means it helps make the sequencing more reliable.
This new approach not only improves the quality of the DNA readings but also enhances how genes are understood and reduces mistakes in identifying genetic variations.

View Article and Find Full Text PDF

PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions.

Nathan D Olson Justin Wagner Jennifer McDaniel Sarah H Stephens Samuel T Westreich Gunjan Baid

Cell Genom

May 2022

The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications.

View Article and Find Full Text PDF

Accelerated identification of disease-causing variants with ultra-rapid nanopore genome sequencing.

Sneha D Goenka John E Gorzynski Kishwar Shafin Dianna G Fisk Trevor Pesout Gunjan Baid

Nat Biotechnol

July 2022

Whole-genome sequencing (WGS) can identify variants that cause genetic disease, but the time required for sequencing and analysis has been a barrier to its use in acutely ill patients. In the present study, we develop an approach for ultra-rapid nanopore WGS that combines an optimized sample preparation protocol, distributing sequencing over 48 flow cells, near real-time base calling and alignment, accelerated variant calling and fast variant filtration for efficient manual review. Application to two example clinical cases identified a candidate variant in <8 h from sample preparation to variant identification.

View Article and Find Full Text PDF

Ultrarapid Nanopore Genome Sequencing in a Critical Care Setting.

John E Gorzynski Sneha D Goenka Kishwar Shafin Tanner D Jensen Dianna G Fisk Gunjan Baid

N Engl J Med

February 2022

View Article and Find Full Text PDF

Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.

Kishwar Shafin Trevor Pesout Pi-Chuan Chang Maria Nattestad Alexey Kolesnikov Gunjan Baid

Nat Methods

November 2021

Long-read sequencing has the potential to transform variant detection by reaching currently difficult-to-map regions and routinely linking together adjacent variations to enable read-based phasing. Third-generation nanopore sequence data have demonstrated a long read length, but current interpretation methods for their novel pore-based signal have unique error profiles, making accurate analysis challenging. Here, we introduce a haplotype-aware variant calling pipeline, PEPPER-Margin-DeepVariant, that produces state-of-the-art variant calling results with nanopore data.

View Article and Find Full Text PDF