Appropriate medical data categorization for data mining classification techniques.

Med Inform Internet Med

Department of Internal Medicine, Chang Gung Memorial Hospital, Kaohsiung, Taiwan.

Published: March 2002

Some data mining (DM) methods, or software tools, require normalized data, others rely on categorized data, and some can accommodate multiple data scales. Each DM technique has a specific background theory; therefore, different results are expected when applying multiple methods. The purpose of this study is to find the data format appropriate for each DM classification technique for wider applications, and efficiently to obtain trustworthy results. Considering the nature of medical data, categorical variables are sometimes useful for making decisions and can make it easier to extrapolate knowledge. In this study, three mathematical data categorization methods (Fusinter, minimum description length principle [MDLPC] and Chi-merge) were applied to accommodate five data mining classification techniques (statistics discriminant analysis, supervised classification with Neural Networks, Decision trees, Genetic supervised clustering and Bayesian classification [probability neural networks; PNN]) using a heart disease database with four types of data (continuous data, binary data, nominal data, and ordinal data). Compared with original or normalized data, data categorized by the MDLPC categorization method was found to perform better in most of the DM classification techniques used in this study. Categorical data is good for most DM classification techniques (e.g. classification of disease and non-disease groups) and is relatively easy to use for extracting medical knowledge.

Download full-text PDF

Source
http://dx.doi.org/10.1080/14639230210153749DOI Listing

Publication Analysis

Top Keywords

data
18
classification techniques
16
data mining
12
medical data
8
data categorization
8
classification
8
mining classification
8
normalized data
8
neural networks
8
appropriate medical
4

Similar Publications

The study was designed to investigate the pattern of intraventricular Hemo-Dynamic Forces (HDF) and myocardial performance during exercise in Elite Cyclists (EC). Transthoracic stress echocardiography was performed on nineteen EC and thirteen age-matched sedentary controls (SC) at three incremental exercise intensities based on Heart Rate Reserve (HRR). Left Ventricular (LV) HDF were computed from echocardiography long-axis data sets using a novel technique based on endocardial boundary tracking, both in apex-base and latero-septal directions.

View Article and Find Full Text PDF

Deep learning has emerged as a powerful tool in medical imaging, particularly for corneal topographic map classification. However, the scarcity of labeled data poses a significant challenge to achieving robust performance. This study investigates the impact of various data augmentation strategies on enhancing the performance of a customized convolutional neural network model for corneal topographic map classification.

View Article and Find Full Text PDF

Background: Tissue-based genomic classifiers (GCs) have been developed to improve prostate cancer (PCa) risk assessment and treatment recommendations.

Purpose: To summarize the impact of the Decipher, Oncotype DX Genomic Prostate Score (GPS), and Prolaris GCs on risk stratification and patient-clinician decisions on treatment choice among patients with localized PCa considering first-line treatment.

Data Sources: MEDLINE, EMBASE, and Web of Science published from January 2010 to August 2024.

View Article and Find Full Text PDF

Background: Beyond physical health, managing type 1 diabetes (T1D) also encompasses a psychological component, including diabetes distress, that is, the worries, fears, and frustrations associated with meeting self-care demands over the lifetime. While digital health solutions have been increasingly used to address emotional health in diabetes, these technologies may not uniformly meet the unique concerns and technological savvy across all age groups.

Objective: This study aimed to explore the mental health needs of adolescents with T1D, determine their preferred modalities for app-based mental health support, and identify desirable design features for peer-delivered mental health support modeled on an app designed for adults with T1D.

View Article and Find Full Text PDF

Background: Having a great amount of sedentary time is common among older adults and increases with age. There is a strong need for tools to reduce sedentary time and promote adherence to reduced sedentary time, for which eHealth interventions have the potential to be useful. Interventions for reducing sedentary time in older adults have been found to be more effective when elements of self-management are included.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!