Purpose: This study aimed to compare the cost-effectiveness of AI-based approaches with manual approaches in ultrasound image quality control (QC).

Methods: Eligible ultrasonographers and pregnant volunteers were prospectively recruited from the Hunan Maternal and Child Health Hospital in May 2023. The ultrasonographers were randomly and evenly assigned to either the AI or Manual QC groups with baseline scores determined in June-July. From August to October, these groups received real-time AI or post-scan manual QC with post-interventional scores recorded monthly. We applied the repeated measures analysis of variance to analyze the between-subject and within-subject effectiveness and time trends in effectiveness (QC score improvement) assessment. An extra 50 pregnant volunteers underwent real-time manual QC, with their screening images utilized for post-scan AI and manual QC. The time cost of real-time AI QC was zero since it only required trainees' involvement. We used Friedman's M and Quade tests to compare multiple independent medians in cost assessment.

Results: This study recruited 14 ultrasonographers, equally divided into the AI and Manual QC groups. No significant difference existed between the groups concerning age, service year in perinatal diagnosis, male proportion, and QC frequency. The simple effect of the group revealed that the AI QC method outperformed the Manual QC method at least once (F = 13.113, P = 0.004, η = 0.522). The simple effect of the month in the AI QC groups indicated an improvement in the mean QC scores (F = 9.827, P = 0.003, η = 0.747) while that of manual QC groups suggested no improvement (F = 0.144, P = 0.931, η = 0.041). Baseline scores were equal in June-July (F = 0.031, P = 0.864, η = 0.003). However, the AI QC group surpassed the Manual QC group in August (F = 14.579, P = 0.002, η = 0.549), September (F = 28.590, P < 0.001, η = 0.704), and October (F = 35.411, P < 0.001, η = 0.747). Within the Manual QC group, no significant differences were found in scores between June-July and August, September, or October (all P values of 1.000, nominal significance level of 0.0083). In contrast, the AI QC group showed significantly higher scores in August, September, and October compared to June-July (all P values of 0.001, nominal significance level of 0.0083). The time costs of real-time AI QC, post-scan AI QC, post-scan manual QC, and real-time manual QC were 0 s, 13.76 s (interquartile range, IQR: 4.79-46.79 s), 1239.50 s (IQR: 1141.00-1311.25 s), and 1541.00 s (IQR: 1453.50-1635.25 s), with significant differences in both overall and multiple comparisons.

Conclusions: The AI QC method, more cost-effective than the manual method, shows great potential for application in image QC scenarios. The AI QC can enhance operators' skills in perinatal ultrasound screening, while the manual method can only maintain the existing level.

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12909-024-06477-wDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654377PMC

Publication Analysis

Top Keywords

manual groups
12
manual
9
image quality
8
quality control
8
pregnant volunteers
8
baseline scores
8
post-scan manual
8
groups
6
cost-effectiveness analysis
4
analysis ai-based
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!