A machine learning approach to unmask novel gene signatures and prediction of Alzheimer's disease within different brain regions.

Genomics

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2033, Australia. Electronic address:

Published: July 2021

Alzheimer's disease (AD) is a progressive neurodegenerative disorder whose aetiology is currently unknown. Although numerous studies have attempted to identify the genetic risk factor(s) of AD, the interpretability and/or the prediction accuracies achieved by these studies remained unsatisfactory, reducing their clinical significance. Here, we employ the ensemble of random-forest and regularized regression model (LASSO) to the AD-associated microarray datasets from four brain regions - Prefrontal cortex, Middle temporal gyrus, Hippocampus, and Entorhinal cortex- to discover novel genetic biomarkers through a machine learning-based feature-selection classification scheme. The proposed scheme unraveled the most optimum and biologically significant classifiers within each brain region, which achieved by far the highest prediction accuracy of AD in 5-fold cross-validation (99% average). Interestingly, along with the novel and prominent biomarkers including CORO1C, SLC25A46, RAE1, ANKIB1, CRLF3, PDYN, numerous non-coding RNA genes were also observed as discriminator, of which AK057435 and BC037880 are uncharacterized long non-coding RNA genes.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2021.04.028DOI Listing

Publication Analysis

Top Keywords

alzheimer's disease
8
brain regions
8
non-coding rna
8
rna genes
8
machine learning
4
learning approach
4
approach unmask
4
unmask novel
4
novel gene
4
gene signatures
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!