Machine Learning-Based Alzheimer's Disease Stage Diagnosis Utilizing Blood Gene Expression and Clinical Data: A Comparative Investigation.

Diagnostics (Basel)

Department of Computer Science and Engineering, Faculty of Engineering and Technology, Technology Campus (Peenya Campus), Ramaiah University of Applied Sciences, Bengaluru 560058, India.

Published: January 2025

This study presents a comparative analysis of the multistage diagnosis of Alzheimer's disease (AD), including mild cognitive impairment (MCI), utilizing two distinct types of biomarkers: blood gene expression and clinical biomarker samples. Both of these samples, obtained from participants in the Alzheimer's Disease Neuroimaging Initiative (ADNI), were independently analyzed utilizing machine learning (ML)-based multiclassifiers. This study applied novel machine learning-based data augmentation techniques to gene expression profile data that are high-dimensional, low-sample-size (HDLSS) and inherently highly imbalanced. The investigation obtained the highest multiclassification performance to date in the multistage diagnosis of Alzheimer's disease utilizing the blood gene expression profiles of Alzheimer's Disease Neuroimaging Initiative (ADNI) participants. Based on the performance results obtained, and other factors such as early prediction capabilities, this study compares the efficacies of the two types of biomarkers for multistage diagnosis. This study presents the sole investigation in which multiclassification-based AD stage diagnosis was conducted utilizing blood gene expression data. We obtained the best multiclassification result in both modalities of the ADNI data in terms of F1-score and were able to identify new genetic biomarkers. The combination of the XGBoost and SFBS (Sequential Floating Backward Selection) methods was used to select the features. We were able to select the 95 most effective gene probe sets out of 49,386. For the clinical study data, eight of the most effective biomarkers were selected using SFBS. A deep learning (DL) classifier was used to identify the stages-cognitive normal (CN), mild cognitive impairment (MCI), and Alzheimer's disease (AD)/dementia. DL, support vector machine (SVM), gradient boosting (GB), and random forest (RF) classifiers were used for the AD stage detection from gene expression profile data. Because of the high data imbalance in genomic data, borderline oversampling/data augmentation was applied in the model training and original samples for validation. Utilizing clinical data, the highest ROC AUC scores attained were 0.989, 0.927, and 0.907 for the identification of the CN, MCI, and dementia stages, respectively. The highest F1 scores achieved were 0.971, 0.939, and 0.886. Employing gene expression data, we obtained ROC AUC scores of 0.763, 0.761, and 0.706 for the CN, MCI, and dementia stages, respectively, and F1 scores of 0.71, 0.77, and 0.53 for CN, MCI, and dementia, respectively. This represents the best outcome to date for AD stage diagnosis from ADNI blood gene expression profile data utilizing multiclassification techniques. The results indicated that our multiclassification model effectively manages the imbalanced data of a high-dimension, low-sample-size (HDLSS) nature to identify samples of the minority class. MAPK14, PLG, FZD2, FXYD6, and TEP1 are among the novel genes identified as being associated with AD risk.

Download full-text PDF

Source
http://dx.doi.org/10.3390/diagnostics15020211DOI Listing

Publication Analysis

Top Keywords

gene expression
32
alzheimer's disease
24
blood gene
20
data
13
stage diagnosis
12
utilizing blood
12
multistage diagnosis
12
expression profile
12
profile data
12
mci dementia
12

Similar Publications

Barley (Hordeum vulgare L.) is an important cereal crop used in animal feed, beer brewing, and food production. Waterlogging stress is one of the prominent abiotic stresses that has a significant impact on the yield and quality of barley.

View Article and Find Full Text PDF

Exploring the dual roles of sec-dependent effectors from Candidatus Liberibacter asiaticus in immunity of citrus plants.

Plant Cell Rep

January 2025

MOE Key Laboratory of Laser Life Science and Institute of Laser Life Science, Guangdong Provincial Key Laboratory of Laser Life Science, Guangzhou Key Laboratory of Spectral Analysis and Functional Probes, College of Biophotonics, School of Optoelectronic Science and Engineering, South China Normal University, Guangzhou, 510631, China.

The three SDEs of CLas were expressed in citrus leaves by AuNPs-PEI mediated transient expression system, and promoted the proliferation of CLas and inhibited citrus immunity. Huanglongbing (HLB) is the most severe bacterial disease of citrus caused by Candidatus Liberibacter asiaticus (CLas). CLas suppress host immune responses and promote infection by sec-dependent effectors (SDEs), thus insight into HLB pathogenesis is urgently needed to develop effective management strategies.

View Article and Find Full Text PDF

This study aims to investigate the expression of seven cancer testis antigens (MAGE-A1, MAGE-A4, MAGE-A10, MAGE-A11, PRAME, NY-ESO-1 and KK-LC-1) in pan squamous cell carcinoma and their prognostic value, thus assessing the potential of these CTAs as immunotherapeutic targets. The protein expression of these CTAs was evaluated by immunohistochemistry in 60 lung squamous cell carcinoma (LUSC), 62 esophageal squamous cell carcinoma (ESCA) and 62 head and neck squamous cell carcinoma (HNSC). The relationship between CTAs expression and progression-free survival (PFS) was assessed.

View Article and Find Full Text PDF

An involvement of a new zinc finger protein PbrZFP719 into pear self-incompatibility reaction.

Plant Cell Rep

January 2025

State Key Laboratory of Crop Genetics and Germplasm Enhancement, Saya Institute of Nanjing Agricultural University, Nanjing Agricultural University, Nanjing, 211800, China.

This study indicated that the CCHC-type zinc finger protein PbrZFP719 involves into self-incompatibility by affecting the levels of reactive oxygen species and cellulose content at the tips of pollen tubes. S-RNase-based self-incompatibility (SI) facilitates cross-pollination and prevents self-pollination, which in turn increases the costs associated with artificial pollination in fruit crops. Self S-RNase exerts its inhibitory effects on pollen tube growth by altering cell structures and components, including reactive oxygen species (ROS) level and cellulose content.

View Article and Find Full Text PDF

Tissue nanotransfection-based endothelial PLCγ2-targeted epigenetic gene editing in vivo rescues perfusion and diabetic ischemic wound healing.

Mol Ther

January 2025

Department of Surgery, McGowan Institute for Regenerative Medicine, University of Pittsburgh, Pittsburgh, PA 15219, United States; Department of Surgery, Indiana Center for Regenerative Medicine and Engineering, Indiana University School of Medicine, Indianapolis, IN 46202, United States. Electronic address:

Diabetic wounds are complicated by underlying peripheral vasculopathy. Reliance on vascular endothelial growth factor (VEGF) therapy to improve perfusion makes logical sense, yet clinical study outcomes on rescuing diabetic wound vascularization have yielded disappointing results. Our previous work has identified that low endothelial phospholipase Cγ2 (PLCγ2) expression hinders the therapeutic effect of VEGF on the diabetic ischemic limb.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!