Classification algorithms with unbalanced datasets tend to produce high predictive accuracy over the majority class, but poor predictive accuracy over the minority class. This problem is very common in biomedical data mining. This paper introduces a Support Vector Machine (SVM)-based optimized feature selection method, to select the most relevant features and maintain an accurate and well-balanced sensitivity-specificity result between unbalanced groups. A new metric called the balance index (B) is defined to implement this optimization. The balance index measures the difference between the misclassified data within each class. The proposed optimized feature selection is applied to the classification of patients' weaning trials from mechanical ventilation: patients with successful trials who were able to maintain spontaneous breathing after 48 h and patients who failed to maintain spontaneous breathing and were reconnected to mechanical ventilation after 30 min. Patients are characterized through cardiac and respiratory signals, applying joint symbolic dynamic (JSD) analysis to cardiac interbeat and breath durations. First, the most suitable parameters (C+,C-,σ) are selected to define the appropriate SVM. Then, the feature selection process is carried out with this SVM, to maintain B lower than 40%. The best result is obtained using 6 features with an accuracy of 80%, a B of 18.64%, a sensitivity of 74.36% and a specificity of 82.42%.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2013.01.014DOI Listing

Publication Analysis

Top Keywords

feature selection
16
predictive accuracy
8
optimized feature
8
mechanical ventilation
8
maintain spontaneous
8
spontaneous breathing
8
svm-based feature
4
selection
4
selection optimize
4
optimize sensitivity-specificity
4

Similar Publications

Saturated fat in an evolutionary context.

Lipids Health Dis

January 2025

Institute of Health, Oslo New University College, Ullevålsveien 76, Oslo, 0454, Norway.

Evolutionary perspectives have yielded profound insights in health and medical sciences. A fundamental recognition is that modern diet and lifestyle practices are mismatched with the human physiological constitution, shaped over eons in response to environmental selective pressures. This Darwinian angle can help illuminate and resolve issues in nutrition, including the contentious issue of fat consumption.

View Article and Find Full Text PDF

Background: With the rising diagnostic rate of gallbladder polypoid lesions (GPLs), differentiating benign cholesterol polyps from gallbladder adenomas with a higher preoperative malignancy risk is crucial. This study aimed to establish a preoperative prediction model capable of accurately distinguishing between gallbladder adenomas and cholesterol polyps using machine learning algorithms.

Materials And Methods: We retrospectively analysed the patients' clinical baseline data, serological indicators, and ultrasound imaging data.

View Article and Find Full Text PDF

Background: In infected hosts, immune responses trigger a systemic energy reallocation away from energy storage and growth, to fuel a costly defense program. The exact energy costs of immune defense are however unknown in general. Life history theory predicts that such costs underpin trade-offs between host disease resistance and other fitness related traits, yet this has been seldom assessed.

View Article and Find Full Text PDF

Purpose: Build machine learning (ML) models able to predict pathological complete response (pCR) after neoadjuvant chemotherapy (NAC) in breast cancer (BC) patients based on conventional and radiomic signatures extracted from baseline [F]FDG PET/CT.

Material And Methods: Primary tumor and the most significant lymph node metastasis were manually segmented in baseline [F]FDG PET/CT of 52 newly diagnosed BC patients. Clinical parameters, NAC and conventional semiquantitative PET parameters were collected.

View Article and Find Full Text PDF

Prediction of isocitrate dehydrogenase (IDH) mutation status and epilepsy occurrence are important to glioma patients. Although machine learning models have been constructed for both issues, the correlation between them has not been explored. Our study aimed to exploit this correlation to improve the performance of both of the IDH mutation status identification and epilepsy diagnosis models in patients with glioma II-IV.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!