SNP selection and multidimensional scaling to quantify population structure.

Genet Epidemiol

Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA.

Published: September 2009

In the new era of large-scale collaborative Genome Wide Association Studies (GWAS), population stratification has become a critical issue that must be addressed. In order to build upon the methods developed to control the confounding effect of a structured population, it is extremely important to visualize and quantify that effect. In this work, we develop methodology for single nucleotide polymorphism (SNP) selection and subsequent population stratification visualization based on deviation from Hardy-Weinberg equilibrium in conjunction with non-metric multidimensional scaling (MDS); a distance-based multivariate technique. Through simulation, it is shown that SNP selection based on Hardy-Weinberg disequilibrium (HWD) is robust against confounding linkage disequilibrium patterns that have been problematic in past studies and methods as well as producing a differentiated SNP set. Non-metric MDS is shown to be a multivariate visualization tool preferable to principal components in conjunction with HWD SNP selection through theoretical and empirical study from HapMap samples. The proposed selection tool offers a simple and effective way to select appropriate substructure-informative markers for use in exploring the effect that population stratification may have in association studies.

Download full-text PDF

Source
http://dx.doi.org/10.1002/gepi.20401DOI Listing

Publication Analysis

Top Keywords

snp selection
16
population stratification
12
multidimensional scaling
8
association studies
8
snp
5
population
5
selection multidimensional
4
scaling quantify
4
quantify population
4
population structure
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!