Background: While videostroboscopy is recognized as the most popular approach for investigating vocal fold function, evaluating the numerical values, such as the membranous glottal gap area, remains too time consuming for clinical applications.
Methods: We used a total of 2507 videostroboscopy images from 137 patients and developed five U-Net-based deep-learning image segmentation models for automatic masking of the membranous glottal gap area. To further validate the models, we used another 410 images from 41 different patients.
Results: During development, all five models exhibited acceptable and similar metrics. While the VGG19 U-Net had a long inference time of 1654 ms, the other four models had more practical inference times, ranging from 16 to 138 ms. During further validation, Efficient U-Net demonstrated the highest intersection over union of 0.8455, the highest Dice coefficient of 0.9163, and the lowest Hausdorff distance of 1.5626. The normalized membranous glottal gap area index was also calculated and validated. Efficient U-Net and VGG19 U-Net exhibited the lowest mean squared errors (3.5476 and 3.3842) and the lowest mean absolute errors (1.8835 and 1.8396).
Conclusions: Automatic segmentation of the membranous glottal gap area can be achieved through U-net-based architecture. Considering the segmentation quality and speed, Efficient U-Net is a reasonable choice for this task, while the other four models remain valuable competitors. The models' masked area enables possible calculation of the normalized membranous glottal gap area and analysis of the glottal area waveform, revealing promising clinical applications for this model.
Level Of Evidence: NA Laryngoscope, 134:2835-2843, 2024.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/lary.31266 | DOI Listing |
J Voice
September 2024
Department of Head and Neck Surgery, David Geffen School of Medicine at University of California, Los Angeles, Los Angeles, California; Greater Los Angeles Veterans Affairs Healthcare System, Los Angeles, California. Electronic address:
Objective: Sex differences in response to trauma and physiologic stressors have been identified in numerous organ systems but have not yet been defined in the larynx. The objective of this study was to develop an endoscopic vocal fold injury model in rabbits and to compare structural and functional outcomes between male and female subjects.
Study Design: Basic science study.
Laryngoscope
June 2024
Department of Otorhinolaryngology, Head and Neck Surgery, Changhua Christian Hospital, Changhua, Taiwan.
Background: While videostroboscopy is recognized as the most popular approach for investigating vocal fold function, evaluating the numerical values, such as the membranous glottal gap area, remains too time consuming for clinical applications.
Methods: We used a total of 2507 videostroboscopy images from 137 patients and developed five U-Net-based deep-learning image segmentation models for automatic masking of the membranous glottal gap area. To further validate the models, we used another 410 images from 41 different patients.
J Mech Behav Biomed Mater
November 2023
Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada. Electronic address:
Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. It has been linked to inefficient voice production and voice disorders. Various incomplete glottal closure patterns can arise and the mechanisms driving them are not well understood.
View Article and Find Full Text PDFLaryngoscope
January 2024
Department of Head and Neck Surgery, University of California, Los Angeles, California, USA.
ArXiv
July 2023
Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada.
Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. In this work, we introduce an Euler-Bernoulli composite beam vocal fold (VF) model that produces qualitatively similar incomplete glottal closure patterns as those observed in experimental and high-fidelity numerical studies, thus offering insights in to the potential underlying physical mechanisms. Refined physiological insights are pursued by incorporating the beam model into a VF posturing model that embeds the five intrinsic laryngeal muscles.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!