Robustness of case-control studies of genetic factors to population stratification: magnitude of bias and type I error.

Cancer Epidemiol Biomarkers Prev

Institut National d'Etudes Démographiques, 133 Boulevard Davout, 75980 Paris Cedex 20, France.

Published: October 2004

Case-control studies of genetic factors are prone to a special form of confounding called population stratification, whenever the existence of one or more subpopulations may lead to a false association, be it positive or negative. We quantify both the bias (in terms of confounding risk ratio) and the probability of false association (type I error) in the most unfavorable situation in which only one high-risk subpopulation is hidden within the studied population, considering different scenarios of population structuring and varying sample sizes. In accord with previous work, we find that the bias is likely to be small in most cases. In addition, we show that the same applies to the associated type I error whenever the subpopulation is small in proportion. For instance, when the hidden subpopulation makes up 5% of the entire population, with an allelic frequency of 0.25 (versus 0.10) and a disease rate that is double, then the estimated bias is 1.07 and the type I error associated with a sample of 500 cases and 500 controls is 8% (instead of 5%). We also show that the type I error is substantially greater for a rare allele (frequency of 0.1) than for a common allele (frequency of 0.5) and analyze the pattern of increase of vulnerability to stratification bias with sample size. Based on our findings, we may therefore conclude that with moderate sample sizes the type I error associated with population stratification remains very limited in most realistic scenarios.

Download full-text PDF

Source

Publication Analysis

Top Keywords

type error
24
population stratification
12
case-control studies
8
studies genetic
8
genetic factors
8
false association
8
sample sizes
8
error associated
8
allele frequency
8
population
6

Similar Publications

Transformers for Neuroimage Segmentation: Scoping Review.

J Med Internet Res

January 2025

Department of Computer Science and Software Engineering, United Arab Emirates University, Al Ain, United Arab Emirates.

Background: Neuroimaging segmentation is increasingly important for diagnosing and planning treatments for neurological diseases. Manual segmentation is time-consuming, apart from being prone to human error and variability. Transformers are a promising deep learning approach for automated medical image segmentation.

View Article and Find Full Text PDF

Pathway analysis plays a critical role in bioinformatics, enabling researchers to identify biological pathways associated with various conditions by analyzing gene expression data. However, the rise of large, multi-center datasets has highlighted limitations in traditional methods like Over-Representation Analysis (ORA) and Functional Class Scoring (FCS), which struggle with low signal-to-noise ratios (SNR) and large sample sizes. To tackle these challenges, we use a deep learning-based classification method, Gene PointNet, and a novel $P$-value computation approach leveraging the confusion matrix to address pathway analysis tasks.

View Article and Find Full Text PDF

Background: The optimal control of type 2 diabetes (T2D) is defined by the innate mastery of self-management behaviours. This study is designed to condense the lived experiences of people with T2D in relation to factors 'exterior' to themselves into a universal self-management inventory (Assessment of Self-Management Questionnaire in Diabetes Mellitus-External Reality; ASQ-DM-EX).

Methods: We collected responses to an online and physical survey from people living with T2D through a quantitative cross-sectional study.

View Article and Find Full Text PDF

This study explored the role of dentate status and dental caries on diabetes-related complications among patients with type 2 diabetes mellitus (T2DM). A hospital-based cross-sectional design was applied to collect data on diabetic patients attending integrated services for non-communicable diseases and oral health at a public hospital in Thailand. Diabetic complication outcomes included diabetic eye and foot complications and chronic kidney disease (CKD).

View Article and Find Full Text PDF

Background: We evaluated UK nurses' preferences for pediatric hexavalent vaccine attributes.

Research Design And Methods: In a discrete-choice experiment study, 150 nurses chose between 2 hypothetical pediatric hexavalent vaccines with varying attribute levels (device type, plastic in packaging, time on the market, and time the vaccine can stay safely at room temperature) in a series of choice questions. Using random-parameters logit-model estimates, conditional relative attribute importance (CRAI) and odds ratios (ORs) were calculated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!