Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.

Ickhwan Kwon Soo-Geun Wang Sung-Chan Shin Yong-Il Cheon Byung-Joo Lee Jin-Choon Lee Dong-Won Lim Cheolwoo Jo Youngseuk Cho Bum-Joo Shin

J Voice

Department of Applied IT and Engineering, Pusan National University, Miryang, Gyeongsangnam-do, South Korea. Electronic address:

Published: September 2022

Objectives: The purpose of study is to improve the classification accuracy by comparing the results obtained by applying decision tree ensemble learning, which is one of the methods to increase the classification accuracy for a relatively small dataset, with the results obtained by the convolutional neural network (CNN) algorithm for the diagnosis of glottal cancer.

Methods: Pusan National University Hospital (PNUH) dataset were used to establish classifiers and Pusan National University Yangsan Hospital (PNUYH) dataset were used to verify the classifier's performance in the generated model. For the diagnosis of glottic cancer, deep learning-based CNN models were established and classified using laryngeal image and voice data. Classification accuracy was obtained by performing decision tree ensemble learning using probability through CNN classification algorithm. In this process, the classification and regression tree (CART) method was used. Then, we compared the classification accuracy of decision tree ensemble learning with CNN individual classifiers by fusing the laryngeal image with the voice decision tree classifier.

Results: We obtained classification accuracy of 81.03 % and 99.18 % in the established laryngeal image and voice classification models using PNUH training dataset, respectively. However, the classification accuracy of CNN classifiers decreased to 73.88 % in voice and 68.92 % in laryngeal image when using an external dataset of PNUYH. To solve this problem, decision tree ensemble learning of laryngeal image and voice was used, and the classification accuracy was improved by integrating data of laryngeal image and voice of the same person. The classification accuracy was 87.88 % and 89.06 % for the individualized laryngeal image and voice decision tree model respectively, and the fusion of the laryngeal image and voice decision tree results represented a classification accuracy of 95.31 %.

Conclusion: The results of our study suggest that decision tree ensemble learning aimed at training multiple classifiers is useful to obtain an increased classification accuracy despite a small dataset. Although a large data amount is essential for AI analysis, when an integrated approach is taken by combining various input data high diagnostic classification accuracy can be expected.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jvoice.2022.07.007	DOI Listing

Publication Analysis

Top Keywords

classification accuracy

laryngeal image

image voice

decision tree

ensemble learning

tree ensemble

classification

voice decision

accuracy

laryngeal

Similar Publications

Cognitive load detection through EEG lead wise feature optimization and ensemble classification.

Sci Rep

January 2025

Department of ECE, Kallam Haranadhareddy Institute of Technology, Guntur, Andhra Pradesh, India.

Jammisetty Yedukondalu Kalyani Sunkara Vankayalapati Radhika Sivakrishna Kondaveeti Murali Anumothu

Cognitive load stimulates neural activity, essential for understanding the brain's response to stress-inducing stimuli or mental strain. This study examines the feasibility of evaluating cognitive load by extracting, selection, and classifying features from electroencephalogram (EEG) signals. We employed robust local mean decomposition (R-LMD) to decompose EEG data from each channel, recorded over a four-second period, into five modes.

View Article and Find Full Text PDF

Similar Publications

Exploring Chronic Pain in Hemodialysis Patients: An Observational Study Based on the New IASP Classification for ICD-11.

Pain Ther

January 2025

Department of Medicine, Nephrology Division, University of Verona, Verona, Italy.

Vittorio Schweiger Martina Cacciapuoti Marta Nizzero Salvatore Simari Gianmarco Lombardi

Introduction: Pain is one of the most frequently reported symptoms in hemodialyzed (HD) patients, with prevalence rates between 33% and 82%. Risk factors for chronic pain in HD patients are older age, long-lasting dialysis history, several concomitant diseases, malnutrition, and others. However, chronic pain assessment in HD patients is rarely performed by specialists in pain medicine, with relevant consequences in terms of diagnostic and treatment accuracy.

View Article and Find Full Text PDF

Similar Publications

Electrical impedance-based tissue classification for bladder tumor differentiation.

Sci Rep

January 2025

Institute for System Dynamics, University of Stuttgart, Waldburgstr. 19, 70563, Stuttgart, Germany.

Carina Veil Franziska Krauß Bastian Amend Falko Fend Oliver Sawodny

Including sensor information in medical interventions aims to support surgeons to decide on subsequent action steps by characterizing tissue intraoperatively. With bladder cancer, an important issue is tumor recurrence because of failure to remove the entire tumor. Impedance measurements can help to classify bladder tissue and give the surgeons an indication on how much tissue to remove.

View Article and Find Full Text PDF

Similar Publications

Unifying topological structure and self-attention mechanism for node classification in directed networks.

Sci Rep

January 2025

College of Computer and Information Engineering, Nanjing Tech University, Nanjing, 211800, China.

Yue Peng Jiwen Xia Dafeng Liu Miao Liu Long Xiao

Graph data is essential for modeling complex relationships among entities. Graph Neural Networks (GNNs) have demonstrated effectiveness in processing low-order undirected graph data; however, in complex directed graphs, relationships between nodes extend beyond first-order connections and encompass higher-order relationships. Additionally, the asymmetry introduced by edge directionality further complicates node interactions, presenting greater challenges for extracting node information.

View Article and Find Full Text PDF

Similar Publications

Leveraging U-Net and selective feature extraction for land cover classification using remote sensing imagery.

Sci Rep

January 2025

Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, 08193, Spain.

Leo Thomas Ramos Angel D Sappa

In this study, we explore an enhancement to the U-Net architecture by integrating SK-ResNeXt as the encoder for Land Cover Classification (LCC) tasks using Multispectral Imaging (MSI). SK-ResNeXt introduces cardinality and adaptive kernel sizes, allowing U-Net to better capture multi-scale features and adjust more effectively to variations in spatial resolution, thereby enhancing the model's ability to segment complex land cover types. We evaluate this approach using the Five-Billion-Pixels dataset, composed of 150 large-scale RGB-NIR images and over 5 billion labeled pixels across 24 categories.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!