Ranking bias in association studies.

Hum Hered

Office of Biostatistics Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Md. 20892, USA.

Published: May 2009

Background: It is widely appreciated that genomewide association studies often yield overestimates of the association of a marker with disease when attention focuses upon the marker showing the strongest relationship. For example, in a case-control setting the largest (in absolute value) estimated odds ratio has been found to typically overstate the association as measured in a second, independent set of data. The most common reason given for this observation is that the choice of the most extreme test statistic is often conditional upon first observing a significant p value associated with the marker. A second, less appreciated reason is described here. Under common circumstances it is the multiple testing of many markers and subsequent focus upon those with most extreme test statistics (i.e. highly ranked results) that leads to bias in the estimated effect sizes.

Conclusions: This bias, termed ranking bias, is separate from that arising from conditioning on a significant p value and may often be a more important factor in generating bias. An analytic description of this bias, simulations demonstrating its extent, and identification of some factors leading to its exacerbation are presented.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2880722PMC
http://dx.doi.org/10.1159/000194979DOI Listing

Publication Analysis

Top Keywords

ranking bias
8
association studies
8
extreme test
8
bias
5
association
4
bias association
4
studies background
4
background appreciated
4
appreciated genomewide
4
genomewide association
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!