AI Article Synopsis

  • Identifying patients with rare diseases like Hunter syndrome is difficult due to common symptoms.
  • A Naïve Bayes classification algorithm was developed using patient data from electronic medical records, successfully identifying individuals at high risk for Hunter syndrome among a large dataset.
  • The model demonstrated effectiveness in aiding physicians with diagnosis and improving patient management, showcasing its utility compared to other Bayesian networks.

Article Abstract

Identifying patients with rare diseases associated with common symptoms is challenging. Hunter syndrome, or Mucopolysaccharidosis type II is a progressive rare disease caused by a deficiency in the activity of the lysosomal enzyme, iduronate 2-sulphatase. It is inherited in an X-linked manner resulting in males being significantly affected. Expression in females varies with the majority being unaffected although symptoms may emerge over time. We developed a Naïve Bayes classification (NBC) algorithm utilizing the clinical diagnosis and symptoms of patients contained within their de-identified and unstructured electronic medical records (EMR) extracted by the Canadian Primary Care Sentinel Surveillance Network (CPCSSN). To do so, we created a training dataset using published results in the scientific literature and from all MPS II symptoms and applied the training dataset and its independent features to compute the conditional posterior probabilities of having MPS II disease as a categorical dependent variable for 506497 male patients. The classifier identified 125 patients with the highest likelihood for having the disease and 18 features were selected to be necessary for forecasting. Next, a Recursive Backward Feature Elimination algorithm was employed, for optimal input features of the NBC model, using a k-fold Cross-Validation with 3 replicates. The accuracy of the final model was estimated by the Validation Set Approach technique and the bootstrap resampling. We also investigated that whether the NBC is as accurate as three other Bayesian networks. The Naïve Bayes Classifier appears to be an efficient algorithm in assisting physicians with the diagnosis of Hunter syndrome allowing optimal patient management.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6300265PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0209018PLOS

Publication Analysis

Top Keywords

naïve bayes
12
mucopolysaccharidosis type
8
bayes classifier
8
rare disease
8
electronic medical
8
medical records
8
canadian primary
8
primary care
8
care sentinel
8
sentinel surveillance
8

Similar Publications

Background: Machine learning models can reduce the burden on doctors by converting medical records into International Classification of Diseases (ICD) codes in real time, thereby enhancing the efficiency of diagnosis and treatment. However, it faces challenges such as small datasets, diverse writing styles, unstructured records, and the need for semimanual preprocessing. Existing approaches, such as naive Bayes, Word2Vec, and convolutional neural networks, have limitations in handling missing values and understanding the context of medical texts, leading to a high error rate.

View Article and Find Full Text PDF

Empirical Bayes Linked Matrix Decomposition.

Mach Learn

October 2024

Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, 55455, MN, USA.

Data for several applications in diverse fields can be represented as multiple matrices that are linked across rows or columns. This is particularly common in molecular biomedical research, in which multiple molecular "omics" technologies may capture different feature sets (e.g.

View Article and Find Full Text PDF

Predicting risk of future dementia is essential for primary prevention strategies, particularly in the era of novel immunotherapies. However, few studies have developed population-level prediction models using existing routine healthcare data. In this longitudinal retrospective cohort study, we predicted incident dementia using primary and secondary care health records at 5, 10 and 13 years in 144 113 Scottish older adults who were dementia-free prior to 1st April 2009.

View Article and Find Full Text PDF

SIRT6, a member of the sirtuin protein family, is recognized as a tumor suppressor. This study investigates the evolutionary history of the SIRT gene family and examines the selective pressures shaping their functional divergence. Insights into the evolution of these genes may enhance our understanding of their roles in disease pathology.

View Article and Find Full Text PDF

Objective: This study was to explore the factors associated with prolonged hospital length of stay (LOS) in patients with intracranial aneurysms (IAs) undergoing endovascular interventional embolization and construct prediction model machine learning algorithms.

Methods: Employing a retrospective cohort study design, this study collected patients with ruptured IA who received endovascular treatment at Jingzhou First People's Hospital during the inclusion period from September 2022 to December 2023. The entire dataset was randomly split into training and testing dataset with a 7:3 ratio.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!