Objective: This paper is concerned with soft computing techniques for categorizing laryngeal disorders based on information extracted from an image of patient's vocal folds, a voice signal, and questionnaire data.

Methods: Multiple feature sets are exploited to characterize images and voice signals. To characterize colour, texture, and geometry of biological structures seen in colour images of vocal folds, eight feature sets are used. Twelve feature sets are used to obtain a comprehensive characterization of a voice signal (the sustained phonation of the vowel sound /a/). Answers to 14 questions constitute the questionnaire feature set. A committee of support vector machines is designed for categorizing the image, voice, and query data represented by the multiple feature sets into the healthy, nodular and diffuse classes. Five alternatives to aggregate separate SVMs into a committee are explored. Feature selection and classifier design are combined into the same learning process based on genetic search.

Results: Data of all the three modalities were available from 240 patients. Among those, 151 patients belong to the nodular class, 64 to the diffuse class and 25 to the healthy class. When using a single feature set to characterize each modality, the test set data classification accuracy of 75.0%, 72.1%, and 85.0% was obtained for the image, voice and questionnaire data, respectively. The use of multiple feature sets allowed to increase the accuracy to 89.5% and 87.7% for the image and voice data, respectively. The test set data classification accuracy of over 98.0% was obtained from a committee exploiting multiple feature sets from all the three modalities. The highest classification accuracy was achieved when using the SVM-based aggregation with hyper parameters of the SVM determined by genetic search. Bearing in mind the difficulty of the task, the obtained classification accuracy is rather encouraging.

Conclusions: Combination of both multiple feature sets characterizing a single modality and the three modalities allowed to substantially improve the classification accuracy if compared to the highest accuracy obtained from a single feature set and a single modality. In spite of the unbalanced data sets used, the error rates obtained for the three classes were rather similar.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2010.02.002DOI Listing

Publication Analysis

Top Keywords

feature sets
28
multiple feature
20
classification accuracy
20
image voice
16
feature set
12
three modalities
12
feature
11
data
8
questionnaire data
8
laryngeal disorders
8

Similar Publications

Frontal Fibrosing Alopecia (FFA) Part I - Diagnosis and Clinical Presentation.

J Am Acad Dermatol

January 2025

Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami Miller School of Medicine, Miami, FL.

Frontal Fibrosing Alopecia (FFA) is a primary lymphocytic cicatricial alopecia predominantly affecting postmenopausal Caucasian women. It is characterized by a progressive frontotemporal hairline recession that presents as a scarring hairless band and is often accompanied by eyebrow and body hair loss. Although initially described in postmenopausal women, FFA has been observed in a broader demographic, including premenopausal women and occasionally men.

View Article and Find Full Text PDF

Simplifying clinical use of TCGA molecular subtypes through machine learning models.

Cancer Cell

January 2025

Computational Oncology Service, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; Halvorsen Center for Computational Oncology, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA. Electronic address:

In this issue of Cancer Cell, Ellrott et al. present machine learning models to classify samples into The Cancer Genome Atlas molecular subtypes using compact sets of genomic features. These validated, ready-to-use models are publicly available, although some clinical hurdles need to be cleared before they are fully implemented.

View Article and Find Full Text PDF

Utilizing artificial intelligence and cellular population data for timely identification of bacteremia in hospitalized patients.

Int J Med Inform

January 2025

Department of Emergency Medicine, China Medical University Hospital, Taichung, Taiwan; School of Medicine, College of Medicine, China Medical University, Taichung, Taiwan. Electronic address:

Background: Bacteremia is a critical condition with high mortality that requires prompt detection to prevent progression to life-threatening sepsis. Traditional diagnostic approaches, such as blood cultures, are time-consuming. This limitation has encouraged the exploration of rapid prediction methodologies.

View Article and Find Full Text PDF

P2X receptors (P2XRs) are adenosine 5'-triphosphate (ATP)-gated ion channels comprising homomeric and heteromeric trimers of seven subtypes (P2X1-P2X7) that confer different rates of desensitization. The helical recoil model of P2XR desensitization proposes stability of the cytoplasmic cap sets the rate of desensitization, but timing of its formation is unclear for slow-desensitizing P2XRs. We report cryo-electron microscopy structures of full-length wild-type human P2X4 receptor in apo closed, antagonist-bound inhibited, and ATP-bound desensitized states.

View Article and Find Full Text PDF

Linking and GenBank to the National Clinical Cohort Collaborative.

Learn Health Syst

January 2025

Department of Biomedical Informatics University of Arkansas for Medical Sciences, College of Medicine Little Rock Arkansas USA.

Objective: This project demonstrates the feasibility of connecting medical imaging data and features, SARS-CoV-2 genome variants, with clinical data in the National Clinical Cohort Collaborative (N3C) repository to accelerate integrative research on detection, diagnosis, and treatment of COVID-19-related morbidities. The N3C curated a rich collection of aggregated and de-identified electronic health records (EHR) data of over 18 million patients, including 7.5 million COVID-positive patients, seen at hospitals across the United States.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!