A quantum-based oversampling method for classification of highly imbalanced and overlapped data.

Exp Biol Med (Maywood)

School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS 39406, USA.

Published: December 2023

Data imbalance is a challenging problem in classification tasks, and when combined with class overlapping, it further deteriorates classification performance. However, existing studies have rarely addressed both issues simultaneously. In this article, we propose a novel quantum-based oversampling method (QOSM) to effectively tackle data imbalance and class overlapping, thereby improving classification performance. QOSM utilizes the quantum potential theory to calculate the potential energy of each sample and selects the sample with the lowest potential as the center of each cover generated by a constructive covering algorithm. This approach optimizes cover center selection and better captures the distribution of the original samples, particularly in the overlapping regions. In addition, oversampling is performed on the samples of the minority class covers to mitigate the imbalance ratio (IR). We evaluated QOSM using three traditional classifiers (support vector machines [SVM], k-nearest neighbor [KNN], and naive Bayes [NB] classifier) on 10 publicly available KEEL data sets characterized by high IRs and varying degrees of overlap. Experimental results demonstrate that QOSM significantly improves classification accuracy compared to approaches that do not address class imbalance and overlapping. Moreover, QOSM consistently outperforms existing oversampling methods tested. With its compatibility with different classifiers, QOSM exhibits promising potential to improve the classification performance of highly imbalanced and overlapped data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10854475PMC
http://dx.doi.org/10.1177/15353702231220665DOI Listing

Publication Analysis

Top Keywords

classification performance
12
quantum-based oversampling
8
oversampling method
8
highly imbalanced
8
imbalanced overlapped
8
overlapped data
8
data imbalance
8
class overlapping
8
classification
6
qosm
6

Similar Publications

Current approaches for classifying biosensor data in diagnostics rely on fixed decision thresholds based on receiver operating characteristic (ROC) curves, which can be limited in accuracy for complex and variable signals. To address these limitations, we developed a framework that facilitates the application of machine learning (ML) to diagnostic data for the binary classification of clinical samples, when using real-time electrochemical measurements. The framework was applied to a real-time multimeric aptamer assay (RT-MAp) that captures single-frequency (12.

View Article and Find Full Text PDF

Parkinson Disease (PD) is a complex neurological disorder attributed by loss of neurons generating dopamine in the SN per compacta. Electroencephalogram (EEG) plays an important role in diagnosing PD as it offers a non-invasive continuous assessment of the disease progression and reflects these complex patterns. This study focuses on the non-linear analysis of resting state EEG signals in PD, with a gender-specific, brain region-specific, and EEG band-specific approach, utilizing recurrence plots (RPs) and machine learning (ML) algorithms for classification.

View Article and Find Full Text PDF

Fast, Present and Future of the Concept of Spondyloarthritis.

Curr Rheumatol Rep

January 2025

Rheumatologisches Versorgungszentrum Steglitz, Ruhr Universität Bochum, Schloßstr.110, 12163, Berlin, Germany.

Purpose Of Review: Axial spondyloarthritis (axSpA) is a rather prevalent chronic inflammatory rheumatic disease that affects already relatively young patients. It has been known better since the end of the nineteenth century but quite a lot has been learned since the early 60ies when the first classification (diagnostic) criteria for ankylosing spondylitis (AS) were agreed on. I have been part of many developments in the last 30 years, and I'm happy to have been able to contribute to the scientific progress in terms of diagnosis, imaging, pathophysiology and therapy.

View Article and Find Full Text PDF

Purpose: To explore the anatomical features of left iliac vein (LIV) in non-thrombotic venous leg ulcers (VLUs) and to identify the impact of these anatomical features on VLUs based on computed tomography venography (CTV).

Methods: This is a retrospective, single-center study of a database (2021-2023) of 431 patients with non-thrombotic chronic venous insufficiency. According to CEAP clinical (C) classifications, cases of C6 and C2 were included for analysis as case and control groups.

View Article and Find Full Text PDF

Background: Paeonia lactiflora Pall., a member of Paeoniaceae family, is a medicinal herb widely used in traditional Chinese medicine. Chloroplasts are multifunctional organelles containing distinct genetic material.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!