Automatic Segmentation of Membranous Glottal Gap Area with U-Net-Based Architecture.

Laryngoscope

Department of Otorhinolaryngology, Head and Neck Surgery, Changhua Christian Hospital, Changhua, Taiwan.

Published: June 2024

Background: While videostroboscopy is recognized as the most popular approach for investigating vocal fold function, evaluating the numerical values, such as the membranous glottal gap area, remains too time consuming for clinical applications.

Methods: We used a total of 2507 videostroboscopy images from 137 patients and developed five U-Net-based deep-learning image segmentation models for automatic masking of the membranous glottal gap area. To further validate the models, we used another 410 images from 41 different patients.

Results: During development, all five models exhibited acceptable and similar metrics. While the VGG19 U-Net had a long inference time of 1654 ms, the other four models had more practical inference times, ranging from 16 to 138 ms. During further validation, Efficient U-Net demonstrated the highest intersection over union of 0.8455, the highest Dice coefficient of 0.9163, and the lowest Hausdorff distance of 1.5626. The normalized membranous glottal gap area index was also calculated and validated. Efficient U-Net and VGG19 U-Net exhibited the lowest mean squared errors (3.5476 and 3.3842) and the lowest mean absolute errors (1.8835 and 1.8396).

Conclusions: Automatic segmentation of the membranous glottal gap area can be achieved through U-net-based architecture. Considering the segmentation quality and speed, Efficient U-Net is a reasonable choice for this task, while the other four models remain valuable competitors. The models' masked area enables possible calculation of the normalized membranous glottal gap area and analysis of the glottal area waveform, revealing promising clinical applications for this model.

Level Of Evidence: NA Laryngoscope, 134:2835-2843, 2024.

Download full-text PDF

Source
http://dx.doi.org/10.1002/lary.31266DOI Listing

Publication Analysis

Top Keywords

membranous glottal
24
glottal gap
24
gap area
24
efficient u-net
12
automatic segmentation
8
segmentation membranous
8
area
8
u-net-based architecture
8
vgg19 u-net
8
normalized membranous
8

Similar Publications

Vocal Fold Injury Produces Similar Biomechanical Outcomes in Male and Female Rabbits.

J Voice

September 2024

Department of Head and Neck Surgery, David Geffen School of Medicine at University of California, Los Angeles, Los Angeles, California; Greater Los Angeles Veterans Affairs Healthcare System, Los Angeles, California. Electronic address:

Objective: Sex differences in response to trauma and physiologic stressors have been identified in numerous organ systems but have not yet been defined in the larynx. The objective of this study was to develop an endoscopic vocal fold injury model in rabbits and to compare structural and functional outcomes between male and female subjects.

Study Design: Basic science study.

View Article and Find Full Text PDF

Background: While videostroboscopy is recognized as the most popular approach for investigating vocal fold function, evaluating the numerical values, such as the membranous glottal gap area, remains too time consuming for clinical applications.

Methods: We used a total of 2507 videostroboscopy images from 137 patients and developed five U-Net-based deep-learning image segmentation models for automatic masking of the membranous glottal gap area. To further validate the models, we used another 410 images from 41 different patients.

View Article and Find Full Text PDF

An Euler-Bernoulli-type beam model of the vocal folds for describing curved and incomplete glottal closure patterns.

J Mech Behav Biomed Mater

November 2023

Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada. Electronic address:

Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. It has been linked to inefficient voice production and voice disorders. Various incomplete glottal closure patterns can arise and the mechanisms driving them are not well understood.

View Article and Find Full Text PDF
Article Synopsis
  • The study aimed to explore and compare the vibratory patterns of canine vocal folds to those of human vocal folds, despite their anatomical differences.
  • Researchers used a canine hemilarynx to capture vocal fold vibrations with high-speed video and performed histological examinations to analyze their structure.
  • Findings revealed that canine vocal folds exhibited similar oscillation patterns as humans, indicating that certain structural features might not be essential for producing human-like vocal sounds.
View Article and Find Full Text PDF

Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. In this work, we introduce an Euler-Bernoulli composite beam vocal fold (VF) model that produces qualitatively similar incomplete glottal closure patterns as those observed in experimental and high-fidelity numerical studies, thus offering insights in to the potential underlying physical mechanisms. Refined physiological insights are pursued by incorporating the beam model into a VF posturing model that embeds the five intrinsic laryngeal muscles.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!