AI Article Synopsis

  • ECG signals are essential for classifying cardiac arrhythmias using machine learning, but datasets often have missing values which complicate classification.
  • Multiple methods for estimating these missing values, including Zero, Mean, PCA-based, and RPCA-based methods, are compared in the paper.
  • The proposed MKDF-WKNN classification algorithm outperforms existing methods for imbalanced datasets, with RPCA effectively managing missing data in arrhythmia datasets.

Article Abstract

Electrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a fully complete matrix as input. Thus it is necessary to impute the missing data to increase the effectiveness of classification for datasets with a few missing values. In this paper, we compare the main methods for estimating the missing values in electrocardiogram data, e.g., the "Zero method", "Mean method", "PCA-based method", and "RPCA-based method" and then propose a novel KNN-based classification algorithm, i.e., a modified kernel Difference-Weighted KNN classifier (MKDF-WKNN), which is fit for the classification of imbalance datasets. The experimental results on the UCI database indicate that the "RPCA-based method" can successfully handle missing values in arrhythmia dataset no matter how many values in it are missing and our proposed classification algorithm, MKDF-WKNN, is superior to other state-of-the-art algorithms like KNN, DS-WKNN, DF-WKNN, and KDF-WKNN for uneven datasets which impacts the accuracy of classification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7327608PMC
http://dx.doi.org/10.1155/2020/7141725DOI Listing

Publication Analysis

Top Keywords

missing values
16
classification
8
modified kernel
8
kernel difference-weighted
8
difference-weighted knn
8
"rpca-based method"
8
classification algorithm
8
missing
7
values
5
method"
5

Similar Publications

Background: Machine learning models can reduce the burden on doctors by converting medical records into International Classification of Diseases (ICD) codes in real time, thereby enhancing the efficiency of diagnosis and treatment. However, it faces challenges such as small datasets, diverse writing styles, unstructured records, and the need for semimanual preprocessing. Existing approaches, such as naive Bayes, Word2Vec, and convolutional neural networks, have limitations in handling missing values and understanding the context of medical texts, leading to a high error rate.

View Article and Find Full Text PDF

Background: The longitudinal study was conducted over the initial 2 years of the COVID-19 pandemic, spanning from June 2020 to December 2022, in healthcare workers (HCWs) of the Thomayer University Hospital. A total of 3892 blood samples were collected and analyzed for total nucleocapsid (N) antibodies. The aim of the study was to evaluate the dynamics of N antibodies, their relationship to the PCR test, spike (S) antibodies, interferon-gamma, and prediction of reinfection with SARS-CoV-2.

View Article and Find Full Text PDF

Background The management of neutropenic fever patients remains challenging. Patients' individual baseline body temperature may provide diagnostic and prognostic value. Methods This study is a retrospective analysis of 92 adults admitted for neutropenic fever to model the length of stay (LOS) and the ability to find a definitive diagnosis using the deviation of patients' temperature on admission from their outpatient baseline, acuity on admission, neutropenia level and persistence, fever persistence, and patients' age.

View Article and Find Full Text PDF

Uncertain choices with asymmetric information: how clear evidence and ambiguity interact?

Front Psychol

December 2024

Control and Intelligent Processing Centre of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.

Real-world decisions often involve partial ambiguity, where the complete picture of potential risks is unclear. In such situations, individuals must make choices by balancing the value of available information against the uncertainty of unknown risks. Our study investigates this challenge by examining how people navigate the trade-off between the favorability of limited evidence and the degree of ambiguity when making decisions under partial ambiguity.

View Article and Find Full Text PDF

Background: Large language models (LLMs) such as ChatGPT-4 (CG4) are proving to be valuable tools in the medical field, not only in facilitating administrative tasks, but in augmenting medical decision-making. LLMs have previously been tested for diagnostic accuracy with expert-generated questions and standardized test data. Among those studies, CG4 consistently outperformed alternative LLMs, including ChatGPT-3.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!