Publications by Alexander W Zaranek | LitMetric

Publications by authors named "Alexander W Zaranek"

Page 1 of 1

An unbiased index to quantify participant's phenotypic contribution to an open-access cohort.

Yingleong Chan Michael Tung Alexander S Garruss Sarah W Zaranek Ying Kai Chan Alexander W Zaranek

Sci Rep

April 2017

The Personal Genome Project (PGP) is an effort to enroll many participants to create an open-access repository of genome, health and trait data for research. However, PGP participants are not enrolled for studying any specific traits and participants choose the phenotypes to disclose. To measure the extent and willingness and to encourage and guide participants to contribute phenotypes, we developed an algorithm to score and rank the phenotypes and participants of the PGP.

View Article and Find Full Text PDF

Principles and Recommendations for Standardizing the Use of the Next-Generation Sequencing Variant File in Clinical Settings.

Ira M Lubin Nazneen Aziz Lawrence J Babb Dennis Ballinger Himani Bisht Alexander W Zaranek

J Mol Diagn

May 2017

A national workgroup convened by the Centers for Disease Control and Prevention identified principles and made recommendations for standardizing the description of sequence data contained within the variant file generated during the course of clinical next-generation sequence analysis for diagnosing human heritable conditions. The specifications for variant files were initially developed to be flexible with regard to content representation to support a variety of research applications. This flexibility permits variation with regard to how sequence findings are described and this depends, in part, on the conventions used.

View Article and Find Full Text PDF

The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

Qing Mao Serban Ciotlos Rebecca Yu Zhang Madeleine P Ball Robert Chin Alexander Wait Zaranek

Gigascience

October 2016

Background: Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology.

View Article and Find Full Text PDF

Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Justin M Zook David Catoe Jennifer McDaniel Lindsay Vang Noah Spies Alexander W Zaranek

Sci Data

June 2016

Article Synopsis

The Genome in a Bottle Consortium, led by NIST, is focusing on creating accurate reference materials and data to improve human genome sequencing and comparison methods.* -
They have compiled a diverse set of sequencing data from seven human genomes, including the pilot genome NA12878, which is now a NIST reference material.* -
The project utilizes data from various sequencing technologies and aims to enhance our understanding of the human genome, as well as improve genomic analysis tools and techniques.*

View Article and Find Full Text PDF

A public resource facilitating clinical use of genomes.

Madeleine P Ball Joseph V Thakuria Alexander Wait Zaranek Tom Clegg Abraham M Rosenbaum

Proc Natl Acad Sci U S A

July 2012

Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved "open consent" process.

View Article and Find Full Text PDF

Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.

Brock A Peters Bahram G Kermani Andrew B Sparks Oleg Alferov Peter Hong Alexander Wait Zaranek

Nature

July 2012

Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes.

View Article and Find Full Text PDF

Back to the future: from genome to metabolome.

Joseph V Thakuria Alexander W Zaranek George M Church Gerard T Berry

Hum Mutat

May 2012

In the traditional medical genetics setting, metabolic disorders, identified either clinically or through biochemical screening, undergo subsequent single gene testing to molecularly confirm diagnosis, provide further insight on natural disease history, and inform on disease management, treatment, familial testing, and reproductive options. For decades now, this process has been responsible for saving many lives worldwide. Only recently, though, has it become possible to move in the opposite direction by starting with an individual's whole genome or exome, and, guided by this data, study more minor perturbations in the absolute values and substrate ratios of clinically important biochemical analytes.

View Article and Find Full Text PDF

Phased whole-genome genetic risk in a family quartet using a major allele reference sequence.

Frederick E Dewey Rong Chen Sergio P Cordero Kelly E Ormond Colleen Caleshu Alexander W Zaranek

PLoS Genet

September 2011

Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci.

View Article and Find Full Text PDF

A survey of genomic traces reveals a common sequencing error, RNA editing, and DNA editing.

Alexander Wait Zaranek Erez Y Levanon Tomer Zecharia Tom Clegg George M Church

PLoS Genet

May 2010

While it is widely held that an organism's genomic information should remain constant, several protein families are known to modify it. Members of the AID/APOBEC protein family can deaminate DNA. Similarly, members of the ADAR family can deaminate RNA.

View Article and Find Full Text PDF

Clinical assessment incorporating a personal genome.

Euan A Ashley Atul J Butte Matthew T Wheeler Rong Chen Teri E Klein Alexander Wait Zaranek

Lancet

May 2010

Background: The cost of genomic information has fallen steeply, but the clinical translation of genetic risk estimates remains unclear. We aimed to undertake an integrated analysis of a complete human genome in a clinical context.

Methods: We assessed a patient with a family history of vascular disease and early sudden death.

View Article and Find Full Text PDF

Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays.

Radoje Drmanac Andrew B Sparks Matthew J Callow Aaron L Halpern Norman L Burns Alexander Wait Zaranek

Science

January 2010

Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.

View Article and Find Full Text PDF

A highly annotated whole-genome sequence of a Korean individual.

Jong-Il Kim Young Seok Ju Hansoo Park Sheehyun Kim Seonwook Lee Alexander Wait Zaranek

Nature

August 2009

Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1.

View Article and Find Full Text PDF

Swift: primary data analysis for the Illumina Solexa sequencing platform.

Nava Whiteford Tom Skelly Christina Curtis Matt E Ritchie Andrea Löhr Alexander Wait Zaranek

Bioinformatics

September 2009

Motivation: Primary data analysis methods are of critical importance in second generation DNA sequencing. Improved methods have the potential to increase yield and reduce the error rates. Openly documented analysis tools enable the user to understand the primary data, this is important for the optimization and validity of their scientific work.

View Article and Find Full Text PDF

Multiplex padlock targeted sequencing reveals human hypermutable CpG variations.

Jin Billy Li Yuan Gao John Aach Kun Zhang Gregory V Kryukov Alexander Wait Zaranek

Genome Res

September 2009

Utilizing the full power of next-generation sequencing often requires the ability to perform large-scale multiplex enrichment of many specific genomic loci in multiple samples. Several technologies have been recently developed but await substantial improvements. We report the 10,000-fold improvement of a previously developed padlock-based approach, and apply the assay to identifying genetic variations in hypermutable CpG regions across human chromosome 21.

View Article and Find Full Text PDF

Free Factories: Unified Infrastructure for Data Intensive Web Services.

Alexander Wait Zaranek Tom Clegg Ward Vandewege George M Church

Proc USENIX Annu Tech Conf

May 2008

We introduce the Free Factory, a platform for deploying data-intensive web services using small clusters of commodity hardware and free software. Independently administered virtual machines called Freegols give application developers the flexibility of a general purpose web server, along with access to distributed batch processing, cache and storage services. Each cluster exploits idle RAM and disk space for cache, and reserves disks in each node for high bandwidth storage.

View Article and Find Full Text PDF