Background: It is estimated that ∼60% of people with Alzheimer's disease (AD) are undetected or undiagnosed, with higher rates of underdiagnosis in low-to middle-income areas with limited medical resources. To promote health equity, we have developed a web-based tool that utilizes easy-to-collect clinical data to enhance AD detection rate in primary care settings.
Methods: This study was leveraged on the data collected from participants of the Australian Imaging, Biomarker & Lifestyle (AIBL) study and the Religious Orders Study and Memory and Aging Project (ROSMAP).
Background: Integrating scores from multiple cognitive tests into a single cognitive composite has been shown to improve sensitivity to detect AD-related cognitive impairment. However, existing composites have little sensitivity to amyloid-β status (Aβ +/-) in preclinical AD.
Objective: Evaluate whether a data-driven approach for deriving cognitive composites can improve the sensitivity to detect Aβ status among cognitively unimpaired (CU) individuals compared to existing cognitive composites.
Several (inter)national longitudinal dementia observational datasets encompassing demographic information, neuroimaging, biomarkers, neuropsychological evaluations, and muti-omics data, have ushered in a new era of potential for integrating machine learning (ML) into dementia research and clinical practice. ML, with its proficiency in handling multi-modal and high-dimensional data, has emerged as an innovative technique to facilitate early diagnosis, differential diagnosis, and to predict onset and progression of mild cognitive impairment and dementia. In this review, we evaluate current and potential applications of ML, including its history in dementia research, how it compares to traditional statistics, the types of datasets it uses and the general workflow.
View Article and Find Full Text PDFMotivation: Biological background knowledge plays an important role in the manual quality assurance (QA) of biological database records. One such QA task is the detection of inconsistencies in literature-based Gene Ontology Annotation (GOA). This manual verification ensures the accuracy of the GO annotations based on a comprehensive review of the literature used as evidence, Gene Ontology (GO) terms, and annotated genes in GOA records.
View Article and Find Full Text PDFBackground: The associations between mood disorders (anxiety and depression) and mild cognitive impairment (MCI) or Alzheimer's dementia (AD) remain unclear.
Methods: Data from the Australian Imaging, Biomarker & Lifestyle (AIBL) study were subjected to logistic regression to determine both cross-sectional and longitudinal associations between anxiety/depression and MCI/AD. Effect modification by selected covariates was analysed using the likelihood ratio test.
Currently, the genetic variants strongly associated with risk for Multiple Sclerosis (MS) are located in the Major Histocompatibility Complex. This includes DRB1*15:01 and DRB1*15:03 alleles at the HLA-DRB1 locus, the latter restricted to African populations; the DQB1*06:02 allele at the HLA-DQB1 locus which is in high linkage disequilibrium (LD) with DRB1*15:01; and protective allele A*02:01 at the HLA-A locus. HLA allele identification is facilitated by co-inherited ('tag') single nucleotide polymorphisms (SNPs); however, SNP validation is not typically done outside of the discovery population.
View Article and Find Full Text PDFYoung breast and bowel cancers (e.g., those diagnosed before age 40 or 50 years) have far greater morbidity and mortality in terms of years of life lost, and are increasing in incidence, but have been less studied.
View Article and Find Full Text PDFCancer Epidemiol Biomarkers Prev
February 2024
Background: Cirrus is an automated risk predictor for breast cancer that comprises texture-based mammographic features and is mostly independent of mammographic density. We investigated genetic and environmental variance of variation in Cirrus.
Methods: We measured Cirrus for 3,195 breast cancer-free participants, including 527 pairs of monozygotic (MZ) twins, 271 pairs of dizygotic (DZ) twins, and 1,599 siblings of twins.
The accumulation of amyloid-β (Aβ) plaques in the brain is considered a hallmark of Alzheimer's disease (AD). Mathematical modeling, capable of predicting the motion and accumulation of Aβ, has obtained increasing interest as a potential alternative to aid the diagnosis of AD and predict disease prognosis. These mathematical models have provided insights into the pathogenesis and progression of AD that are difficult to obtain through experimental studies alone.
View Article and Find Full Text PDFThe recent biological redefinition of Alzheimer's Disease (AD) has spurred the development of statistical models that relate changes in biomarkers with neurodegeneration and worsening condition linked to AD. The ability to measure such changes may facilitate earlier diagnoses for affected individuals and help in monitoring the evolution of their condition. Amongst such statistical tools, disease progression models (DPMs) are quantitative, data-driven methods that specifically attempt to describe the temporal dynamics of biomarkers relevant to AD.
View Article and Find Full Text PDFNucleotide and protein sequences stored in public databases are the cornerstone of many bioinformatics analyses. The records containing these sequences are prone to a wide range of errors, including incorrect functional annotation, sequence contamination and taxonomic misclassification. One source of information that can help to detect errors are the strong interdependency between records.
View Article and Find Full Text PDFFree energy perturbation (FEP) calculations can predict relative binding affinities of an antigen and its point mutants to the same human leukocyte antigen (HLA) with high accuracy (e.g., within 1.
View Article and Find Full Text PDFMotivation: Literature-based gene ontology annotations (GOA) are biological database records that use controlled vocabulary to uniformly represent gene function information that is described in the primary literature. Assurance of the quality of GOA is crucial for supporting biological research. However, a range of different kinds of inconsistencies in between literature as evidence and annotated GO terms can be identified; these have not been systematically studied at record level.
View Article and Find Full Text PDFWhile cochlear implants have helped hundreds of thousands of individuals, it remains difficult to predict the extent to which an individual's hearing will benefit from implantation. Several publications indicate that machine learning may improve predictive accuracy of cochlear implant outcomes compared to classical statistical methods. However, existing studies are limited in terms of model validation and evaluating factors like sample size on predictive performance.
View Article and Find Full Text PDFWhile the majority of cochlear implant recipients benefit from the device, it remains difficult to estimate the degree of benefit for a specific patient prior to implantation. Using data from 2,735 cochlear-implant recipients from across three clinics, the largest retrospective study of cochlear-implant outcomes to date, we investigate the association between 21 preoperative factors and speech recognition approximately one year after implantation and explore the consistency of their effects across the three constituent datasets. We provide evidence of 17 statistically significant associations, in either univariate or multivariate analysis, including confirmation of associations for several predictive factors, which have only been examined in prior smaller studies.
View Article and Find Full Text PDFMotivation: The high accuracy of recent haplotype phasing tools is enabling the integration of haplotype (or phase) information more widely in genetic investigations. One such possibility is phase-aware expression quantitative trait loci (eQTL) analysis, where haplotype-based analysis has the potential to detect associations that may otherwise be missed by standard SNP-based approaches.
Results: We present eQTLHap, a novel method to investigate associations between gene expression and genetic variants, considering their haplotypic and genotypic effect.
Haplotype phasing is a critical step for many genetic applications but incorrect estimates of phase can negatively impact downstream analyses. One proposed strategy to improve phasing accuracy is to combine multiple independent phasing estimates to overcome the limitations of any individual estimate. However, such a strategy is yet to be thoroughly explored.
View Article and Find Full Text PDFHuman Leucocyte Antigen (HLA) testing is useful in the clinical work-up of coeliac disease (CD) with high negative but low positive predictive value. We construct a genomic risk score (GRS) using HLA risk genotypes to improve CD prediction and guide exclusion criteria. Imputed HLA genotypes for five European CD case-control GWAS (n > 15,000) were used to construct and validate an interpretable HLA-based risk model (HDQ), which shows statistically significant improvements in predictive performance upon all previous HLA-based risk models.
View Article and Find Full Text PDFBackground: Coronavirus disease 2019 has highlighted deficiencies in the testing capacity of many developed countries during the early stages of pandemics. Here we describe a strategy using pan-family viral assays to improve early accessibility of large-scale nucleic acid testing.
Methods: Coronaviruses and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) were used as a case study for assessing utility of pan-family viral assays during the early stages of a novel pandemic.
Background: Knowledge of phase, the specific allele sequence on each copy of homologous chromosomes, is increasingly recognized as critical for detecting certain classes of disease-associated mutations. One approach for detecting such mutations is through phased haplotype association analysis. While the accuracy of methods for phasing genotype data has been widely explored, there has been little attention given to phasing accuracy at haplotype block scale.
View Article and Find Full Text PDFBackground: We applied machine learning to find a novel breast cancer predictor based on information in a mammogram.
Methods: Using image-processing techniques, we automatically processed 46 158 analog mammograms for 1345 cases and 4235 controls from a cohort and case-control study of Australian women, and a cohort study of Japanese American women, extracting 20 textural features not based on pixel brightness threshold. We used Bayesian lasso regression to create individual- and mammogram-specific measures of breast cancer risk, Cirrus.
It is increasingly recognized that Alzheimer's disease (AD) exists before dementia is present and that shifts in amyloid beta occur long before clinical symptoms can be detected. Early detection of these molecular changes is a key aspect for the success of interventions aimed at slowing down rates of cognitive decline. Recent evidence indicates that of the two established methods for measuring amyloid, a decrease in cerebrospinal fluid (CSF) amyloid β (Aβ) may be an earlier indicator of Alzheimer's disease risk than measures of amyloid obtained from Positron Emission Tomography (PET).
View Article and Find Full Text PDFAs the cost of DNA sequencing continues to fall, an increasing amount of information on human genetic variation is being produced that could help progress precision medicine. However, information about such mutations is typically first made available in the scientific literature, and is then later manually curated into more standardized genomic databases. This curation process is expensive, time-consuming and many variants do not end up being fully curated, if at all.
View Article and Find Full Text PDF