We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5814750PMC
http://dx.doi.org/10.1002/gepi.22099DOI Listing

Publication Analysis

Top Keywords

two-phase designs
8
regional sequencing
8
designs joint
4
joint quantitative-trait-dependent
4
quantitative-trait-dependent genotype-dependent
4
genotype-dependent sampling
4
sampling post-gwas
4
post-gwas regional
4
sequencing evaluate
4
evaluate two-phase
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!