Disease risk prediction is a rising challenge in the medical domain. Researchers have widely used machine learning algorithms to solve this challenge. The k-nearest neighbour (KNN) algorithm is the most frequently used among the wide range of machine learning algorithms. This paper presents a study on different KNN variants (Classic one, Adaptive, Locally adaptive, k-means clustering, Fuzzy, Mutual, Ensemble, Hassanat and Generalised mean distance) and their performance comparison for disease prediction. This study analysed these variants in-depth through implementations and experimentations using eight machine learning benchmark datasets obtained from Kaggle, UCI Machine learning repository and OpenML. The datasets were related to different disease contexts. We considered the performance measures of accuracy, precision and recall for comparative analysis. The average accuracy values of these variants ranged from 64.22% to 83.62%. The Hassanaat KNN showed the highest average accuracy (83.62%), followed by the ensemble approach KNN (82.34%). A relative performance index is also proposed based on each performance measure to assess each variant and compare the results. This study identified Hassanat KNN as the best performing variant based on the accuracy-based version of this index, followed by the ensemble approach KNN. This study also provided a relative comparison among KNN variants based on precision and recall measures. Finally, this paper summarises which KNN variant is the most promising candidate to follow under the consideration of three performance measures (accuracy, precision and recall) for disease prediction. Healthcare researchers and stakeholders could use the findings of this study to select the appropriate KNN variant for predictive disease risk analytics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9012855PMC
http://dx.doi.org/10.1038/s41598-022-10358-xDOI Listing

Publication Analysis

Top Keywords

machine learning
16
disease prediction
12
precision recall
12
knn
10
k-nearest neighbour
8
neighbour knn
8
knn algorithm
8
disease risk
8
learning algorithms
8
knn variants
8

Similar Publications

Background: In pancreatic surgery Postoperative pancreatic fistula (POPF) represents the most dreaded complication, for which pancreatic texture is acknowledged as one of the strongest predictors. No consensual objective reference has been defined to evaluate the pancreas composition. The presented study aimed to mine histology data of the pancreatic tissue composition with AI assist and correlate it with clinic-pathological parameters derived from the RECOPANC study.

View Article and Find Full Text PDF

Voice Quality as Digital Biomarker in Bipolar Disorder: A Systematic Review.

J Voice

January 2025

Department of Surgery, UMONS Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium; Division of Laryngology and Bronchoesophagology, Department of Otolaryngology Head Neck Surgery, EpiCURA Hospital, Baudour, Belgium; Department of Otolaryngology-Head and Neck Surgery, Foch Hospital, School of Medicine, UFR Simone Veil, Université Versailles Saint-Quentin-en-Yvelines (Paris Saclay University), Paris, France; Department of Otolaryngology, Elsan Hospital, Paris, France. Electronic address:

Background: Voice analysis has emerged as a potential biomarker for mood state detection and monitoring in bipolar disorder (BD). The systematic review aimed to summarize the evidence for voice analysis applications in BD, examining (1) the predictive validity of voice quality outcomes for mood state detection, and (2) the correlation between voice parameters and clinical symptom scales.

Methods: A PubMed, Scopus, and Cochrane Library search was carried out by two investigators for publications investigating voice quality in BD according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statements.

View Article and Find Full Text PDF

A neurocomputational account of multi-line electronic gambling machines.

Trends Cogn Sci

January 2025

Department of Psychology, Biological Psychology, University of Cologne, Cologne, Germany. Electronic address:

Multi-line electronic gambling machines (EGMs) are strongly associated with problem gambling. Dopamine (DA) plays a central role in substance-use disorders, which share clinical and behavioral features with disordered gambling. The structural design features of multi-line EGMs likely lead to the elicitation of various dopaminergic effects within their nested anticipation-outcome structure.

View Article and Find Full Text PDF

Many atopic dermatitis (AD) patients have suboptimal responses to Dupilumab therapy. This study identified key genes linked to this resistance using multi-omics approaches to benefit more patients. We selected a prospective cohort of 54 CE treated with Dupilumab from the GEO database.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!