Data imbalance is a challenging problem in classification tasks, and when combined with class overlapping, it further deteriorates classification performance. However, existing studies have rarely addressed both issues simultaneously. In this article, we propose a novel quantum-based oversampling method (QOSM) to effectively tackle data imbalance and class overlapping, thereby improving classification performance. QOSM utilizes the quantum potential theory to calculate the potential energy of each sample and selects the sample with the lowest potential as the center of each cover generated by a constructive covering algorithm. This approach optimizes cover center selection and better captures the distribution of the original samples, particularly in the overlapping regions. In addition, oversampling is performed on the samples of the minority class covers to mitigate the imbalance ratio (IR). We evaluated QOSM using three traditional classifiers (support vector machines [SVM], k-nearest neighbor [KNN], and naive Bayes [NB] classifier) on 10 publicly available KEEL data sets characterized by high IRs and varying degrees of overlap. Experimental results demonstrate that QOSM significantly improves classification accuracy compared to approaches that do not address class imbalance and overlapping. Moreover, QOSM consistently outperforms existing oversampling methods tested. With its compatibility with different classifiers, QOSM exhibits promising potential to improve the classification performance of highly imbalanced and overlapped data.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10854475 | PMC |
http://dx.doi.org/10.1177/15353702231220665 | DOI Listing |
ACS Sens
January 2025
Department of Engineering Physics, McMaster University, 1280 Main Street West, L8S 4L8 Hamilton, Ontario, Canada.
Current approaches for classifying biosensor data in diagnostics rely on fixed decision thresholds based on receiver operating characteristic (ROC) curves, which can be limited in accuracy for complex and variable signals. To address these limitations, we developed a framework that facilitates the application of machine learning (ML) to diagnostic data for the binary classification of clinical samples, when using real-time electrochemical measurements. The framework was applied to a real-time multimeric aptamer assay (RT-MAp) that captures single-frequency (12.
View Article and Find Full Text PDFPhys Eng Sci Med
January 2025
Amrita School of Artificial Intelligence, Amrita Vishwa Vidyapeetham, Bangalore, India.
Parkinson Disease (PD) is a complex neurological disorder attributed by loss of neurons generating dopamine in the SN per compacta. Electroencephalogram (EEG) plays an important role in diagnosing PD as it offers a non-invasive continuous assessment of the disease progression and reflects these complex patterns. This study focuses on the non-linear analysis of resting state EEG signals in PD, with a gender-specific, brain region-specific, and EEG band-specific approach, utilizing recurrence plots (RPs) and machine learning (ML) algorithms for classification.
View Article and Find Full Text PDFCurr Rheumatol Rep
January 2025
Rheumatologisches Versorgungszentrum Steglitz, Ruhr Universität Bochum, Schloßstr.110, 12163, Berlin, Germany.
Purpose Of Review: Axial spondyloarthritis (axSpA) is a rather prevalent chronic inflammatory rheumatic disease that affects already relatively young patients. It has been known better since the end of the nineteenth century but quite a lot has been learned since the early 60ies when the first classification (diagnostic) criteria for ankylosing spondylitis (AS) were agreed on. I have been part of many developments in the last 30 years, and I'm happy to have been able to contribute to the scientific progress in terms of diagnosis, imaging, pathophysiology and therapy.
View Article and Find Full Text PDFPurpose: To explore the anatomical features of left iliac vein (LIV) in non-thrombotic venous leg ulcers (VLUs) and to identify the impact of these anatomical features on VLUs based on computed tomography venography (CTV).
Methods: This is a retrospective, single-center study of a database (2021-2023) of 431 patients with non-thrombotic chronic venous insufficiency. According to CEAP clinical (C) classifications, cases of C6 and C2 were included for analysis as case and control groups.
Mol Biol Rep
January 2025
School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, 102488, People's Republic of China.
Background: Paeonia lactiflora Pall., a member of Paeoniaceae family, is a medicinal herb widely used in traditional Chinese medicine. Chloroplasts are multifunctional organelles containing distinct genetic material.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!