[Accuracy of multi-task network based on vision Transformer in the three-dimensional upper airway analysis].

Zhonghua Kou Qiang Yi Xue Za Zhi

State Key Laboratory of Oral & Maxillofacial Reconstruction and Regeneration, Key Laboratory of Oral Biomedicine Ministry of Education, Hubei Key Laboratory of Stomatology, School & Hospital of Stomatology, Wuhan University, Wuhan 430079, China.

Published: September 2024

To explore the accuracy of a multi-task model based on vision Transformer for analyzing the three-dimensional (3D) upper airway and its subregions, and to evaluate its clinical applicability. According to the inclusion and exclusion criteria, cone-beam CT (CBCT) data of 10 patients [4 males and 6 females, (20.8±2.7) years] who had their first visit to the Department of Orthodontics in the Hospital of Stomatology, Wuhan University from January 2012 to January 2020 were retrospectively selected. The 3D slicer software was used to segment the upper airway and pharyngeal airway and measure their volumes as the gold standard. The Dolphin 3D software was used to segment the pharyngeal airway and its subregions and measure their volumes as the gold standard. A multi-task model based on vision Transformer developed by the research team for automatic segmentation and volume measurement of the upper airway and its subregions. All the measurements were conducted by the same attending physician. The Bland-Altman analysis and intraclass correlation coefficient () were used to evaluate the consistency between the multi-task network and the gold standard in the upper airway segmentation and volume measurements, and the paired test was used to compare the differences between the multi-tasking model and the gold standard. The mean volume deviation of the upper airway segmented by multi-task model and 3D Slicer was -979.6 mm, and the was 0.97. The mean volume deviation of the pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx segmented by multi-task network and Dolphin 3D were 2 069.5, -950.1, -823.6, -813.9 and 4 003.4 mm, respectively. In addition, in pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx were 0.97, 0.94, 0.96, 0.96 and 0.69, respectively. The multi-task model based on vision Transformer produced different errors in the segmentation of 3D upper airway and its subregions. The segmentation of the nasopharynx, velopharynx and glossopharynx was in good agreement with the gold standard, while the segmentation of hypopharynx was poor, suggesting that the robustness and generalization of this model should be further enhanced.

Download full-text PDF

Source
http://dx.doi.org/10.3760/cma.j.cn112144-20240514-00205DOI Listing

Publication Analysis

Top Keywords

upper airway
28
gold standard
20
based vision
16
vision transformer
16
multi-task model
16
airway subregions
16
pharyngeal airway
16
multi-task network
12
model based
12
nasopharynx velopharynx
12

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!