Motivation: Allowance for increasingly large samples is a key to identify the association of genetic variants with Alzheimer's disease (AD) in genome-wide association studies (GWAS). Accordingly, we aimed to develop a method that incorporates patients with mild cognitive impairment and unknown cognitive status in GWAS using a machine learning-based AD prediction model.

Results: Simulation analyses showed that weighting imputed phenotypes method increased the statistical power compared to ordinary logistic regression using only AD cases and controls. Applied to real-world data, the penalized logistic method had the highest AUC (0.96) for AD prediction and weighting imputed phenotypes method performed well in terms of power. We identified an association (P<5.0×10-8) of AD with several variants in the APOE region and rs143625563 in LMX1A. Our method, which allows the inclusion of individuals with mild cognitive impairment, improves the statistical power of GWAS for AD. We discovered a novel association with LMX1A.

Availability And Implementation: Simulation codes can be accessed at https://github.com/Junkkkk/wGEE_GWAS.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539075PMC
http://dx.doi.org/10.1093/bioinformatics/btad534DOI Listing

Publication Analysis

Top Keywords

machine learning-based
8
statistical power
8
association studies
8
weighting imputed
8
imputed phenotypes
8
phenotypes method
8
learning-based quantification
4
quantification disease
4
disease uncertainty
4
uncertainty increases
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!