The variant call format provides efficient and robust storage of GWAS summary statistics.

Genome Biol

Ronald M. Loeb Center for Alzheimer's Disease, Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, 10029-5674, USA.

Published: January 2021

GWAS summary statistics are fundamental for a variety of research applications yet no common storage format has been widely adopted. Existing tabular formats ambiguously or incompletely store information about genetic variants and associations, lack essential metadata and are typically not indexed yielding poor query performance and increasing the possibility of errors in data interpretation and post-GWAS analyses. To address these issues, we adapted the variant call format to store GWAS summary statistics (GWAS-VCF) and developed open-source tools to use this format in downstream analyses. We provide open access to over 10,000 complete GWAS summary datasets converted to this format ( https://gwas.mrcieu.ac.uk ).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805039PMC
http://dx.doi.org/10.1186/s13059-020-02248-0DOI Listing

Publication Analysis

Top Keywords

gwas summary
16
summary statistics
12
variant call
8
call format
8
format
5
format efficient
4
efficient robust
4
robust storage
4
gwas
4
storage gwas
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!