Estimating the predictive ability of genetic risk models in simulated data based on published results from genome-wide association studies.

Front Genet

Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands ; Department of Epidemiology, Rollins School of Public Health, Emory University Atlanta, GA, USA.

Published: July 2014

Background: There is increasing interest in investigating genetic risk models in empirical studies, but such studies are premature when the expected predictive ability of the risk model is low. We assessed how accurately the predictive ability of genetic risk models can be estimated in simulated data that are created based on the odds ratios (ORs) and frequencies of single-nucleotide polymorphisms (SNPs) obtained from genome-wide association studies (GWASs).

Methods: We aimed to replicate published prediction studies that reported the area under the receiver operating characteristic curve (AUC) as a measure of predictive ability. We searched GWAS articles for all SNPs included in these models and extracted ORs and risk allele frequencies to construct genotypes and disease status for a hypothetical population. Using these hypothetical data, we reconstructed the published genetic risk models and compared their AUC values to those reported in the original articles.

Results: The accuracy of the AUC values varied with the method used for the construction of the risk models. When logistic regression analysis was used to construct the genetic risk model, AUC values estimated by the simulation method were similar to the published values with a median absolute difference of 0.02 [range: 0.00, 0.04]. This difference was 0.03 [range: 0.01, 0.06] and 0.05 [range: 0.01, 0.08] for unweighted and weighted risk scores.

Conclusions: The predictive ability of genetic risk models can be estimated using simulated data based on results from GWASs. Simulation methods can be useful to estimate the predictive ability in the absence of empirical data and to decide whether empirical investigation of genetic risk models is warranted.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4056181PMC
http://dx.doi.org/10.3389/fgene.2014.00179DOI Listing

Publication Analysis

Top Keywords

genetic risk
28
risk models
28
predictive ability
24
ability genetic
12
simulated data
12
auc values
12
risk
11
models
8
data based
8
genome-wide association
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!