Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms.

Fei Yang Jiazhi Du Jiying Lang Weigang Lu Lei Liu Changlong Jin Qinma Kang

Biomed Res Int

School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, China.

Published: April 2021

ECG signals are essential for classifying cardiac arrhythmias using machine learning, but datasets often have missing values which complicate classification.
Multiple methods for estimating these missing values, including Zero, Mean, PCA-based, and RPCA-based methods, are compared in the paper.
The proposed MKDF-WKNN classification algorithm outperforms existing methods for imbalanced datasets, with RPCA effectively managing missing data in arrhythmia datasets.

Electrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a fully complete matrix as input. Thus it is necessary to impute the missing data to increase the effectiveness of classification for datasets with a few missing values. In this paper, we compare the main methods for estimating the missing values in electrocardiogram data, e.g., the "Zero method", "Mean method", "PCA-based method", and "RPCA-based method" and then propose a novel KNN-based classification algorithm, i.e., a modified kernel Difference-Weighted KNN classifier (MKDF-WKNN), which is fit for the classification of imbalance datasets. The experimental results on the UCI database indicate that the "RPCA-based method" can successfully handle missing values in arrhythmia dataset no matter how many values in it are missing and our proposed classification algorithm, MKDF-WKNN, is superior to other state-of-the-art algorithms like KNN, DS-WKNN, DF-WKNN, and KDF-WKNN for uneven datasets which impacts the accuracy of classification.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7327608	PMC
http://dx.doi.org/10.1155/2020/7141725	DOI Listing

Publication Analysis

Top Keywords

missing values

classification

modified kernel

kernel difference-weighted

difference-weighted knn

"rpca-based method"

classification algorithm

missing

values

method"

Similar Publications

Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.

JMIR Med Inform

January 2025

Medical Big Data Research Center, Chinese PLA General Hospital, Beijing, China.

Yan Zhuang Junyan Zhang Xiuxing Li Chao Liu Yue Yu

Background: Machine learning models can reduce the burden on doctors by converting medical records into International Classification of Diseases (ICD) codes in real time, thereby enhancing the efficiency of diagnosis and treatment. However, it faces challenges such as small datasets, diverse writing styles, unstructured records, and the need for semimanual preprocessing. Existing approaches, such as naive Bayes, Word2Vec, and convolutional neural networks, have limitations in handling missing values and understanding the context of medical texts, leading to a high error rate.

View Article and Find Full Text PDF

Similar Publications

Nucleocapsid Antibodies as an Optimal Serological Marker of SARS-CoV-2 Infection: A Longitudinal Study at the Thomayer University Hospital.

J Clin Lab Anal

January 2025

Department of Clinical Biochemistry, Thomayer University Hospital, Prague, Czech Republic.

Markéta Ibrahimová Vladislava Jamriková Kateřina Pavelková Klára Bořecká

Background: The longitudinal study was conducted over the initial 2 years of the COVID-19 pandemic, spanning from June 2020 to December 2022, in healthcare workers (HCWs) of the Thomayer University Hospital. A total of 3892 blood samples were collected and analyzed for total nucleocapsid (N) antibodies. The aim of the study was to evaluate the dynamics of N antibodies, their relationship to the PCR test, spike (S) antibodies, interferon-gamma, and prediction of reinfection with SARS-CoV-2.

View Article and Find Full Text PDF

Similar Publications

Personalized Cutoffs for the Diagnosis of Neutropenic Fever Based on Patients' Baseline Body Temperature: A Retrospective Pilot Study.

Cureus

December 2024

Public Health and Preventive Medicine, State University of New York Upstate Medical University, Syracuse, USA.

Ivayla I Geneva Anthony J Corsi Madison Searles Christina D Lupone

Background The management of neutropenic fever patients remains challenging. Patients' individual baseline body temperature may provide diagnostic and prognostic value. Methods This study is a retrospective analysis of 92 adults admitted for neutropenic fever to model the length of stay (LOS) and the ability to find a definitive diagnosis using the deviation of patients' temperature on admission from their outpatient baseline, acuity on admission, neutropenia level and persistence, fever persistence, and patients' age.

View Article and Find Full Text PDF

Similar Publications

Uncertain choices with asymmetric information: how clear evidence and ambiguity interact?

Front Psychol

December 2024

Control and Intelligent Processing Centre of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.

Amir Hossein Tehrani-Safa Atiye Sarabi-Jamab Abdol-Hossein Vahabie Babak Nadjar Araabi

Real-world decisions often involve partial ambiguity, where the complete picture of potential risks is unclear. In such situations, individuals must make choices by balancing the value of available information against the uncertainty of unknown risks. Our study investigates this challenge by examining how people navigate the trade-off between the favorability of limited evidence and the degree of ambiguity when making decisions under partial ambiguity.

View Article and Find Full Text PDF

Similar Publications

ChatGPT4's diagnostic accuracy in inpatient neurology: A retrospective cohort study.

Heliyon

December 2024

Department of Emergency Medicine, Arrowhead Regional Medical Center, 400 N. Pepper Ave, Colton, CA, 92324, USA.

Sebastian Cano-Besquet Tyler Rice-Canetto Hadi Abou-El-Hassan Simon Alarcon Jason Zimmerman

Background: Large language models (LLMs) such as ChatGPT-4 (CG4) are proving to be valuable tools in the medical field, not only in facilitating administrative tasks, but in augmenting medical decision-making. LLMs have previously been tested for diagnostic accuracy with expert-generated questions and standardized test data. Among those studies, CG4 consistently outperformed alternative LLMs, including ChatGPT-3.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!