Background: Manual objective assessment of skill and errors in minimally invasive surgery have been validated with correlation to surgical expertise and patient outcomes. However, assessment and error annotation can be subjective and are time-consuming processes, often precluding their use. Recent years have seen the development of artificial intelligence models to work towards automating the process to allow reduction of errors and truly objective assessment. This study aimed to validate surgical skill rating and error annotations in suturing gestures to inform the development and evaluation of AI models.

Methods: SAR-RARP50 open data set was blindly, independently annotated at the gesture level in Robotic-Assisted Radical Prostatectomy (RARP) suturing. Manual objective assessment tools and error annotation methodology, Objective Clinical Human Reliability Analysis (OCHRA), were used as ground truth to train and test vision-based deep learning methods to estimate skill and errors. Analysis included descriptive statistics plus tool validity and reliability.

Results: Fifty-four RARP videos (266 min) were analysed. Strong/excellent inter-rater reliability (range r = 0.70-0.89, p < 0.001) and very strong correlation (r = 0.92, p < 0.001) between objective assessment tools was demonstrated. Skill estimation of OSATS and M-GEARS had a Spearman's Correlation Coefficient 0.37 and 0.36, respectively, with normalised mean absolute error representing a prediction error of 17.92% (inverted "accuracy" 82.08%) and 20.6% (inverted "accuracy" 79.4%) respectively. The best performing models in error prediction achieved mean absolute precision of 37.14%, area under the curve 65.10% and Macro-F1 58.97%.

Conclusions: This is the first study to employ detailed error detection methodology and deep learning models within real robotic surgical video. This benchmark evaluation of AI models sets a foundation and promising approach for future advancements in automated technical skill assessment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11614916PMC
http://dx.doi.org/10.1007/s00464-024-11341-5DOI Listing

Publication Analysis

Top Keywords

objective assessment
12
deep learning
8
manual objective
8
skill errors
8
error annotation
8
learning prediction
4
error
4
prediction error
4
skill
4
error skill
4

Similar Publications

Objective: Neurocognitive (NC) impairment in people with HIV (PWH) is associated with erythrocyte indices, which may serve as indicators of iron metabolism, inflammation, and related factors. Erythropoiesis requires iron, regulated by a multifaceted system of peptide hormones, including hepcidin. This study postulated that hepcidin might modify the relationship between erythrocyte indices and NC performance in PWH.

View Article and Find Full Text PDF

Objective: To examine the impact of in utero exposure to dolutegravir (DTG)- or efavirenz (EFV)-based antiretroviral treatment (ART) on child neurodevelopmental (ND) outcomes.

Design: Prospective cohort design, enrolling 3 cohorts of 2-year-olds: children HIV-negative born to mothers with HIV (CHEU) receiving either DTG-based or EFV-based 3-drug ART during pregnancy, and children born to mothers without HIV (CHUU).

Methods: Primary child ND outcomes were assessed using the Bayley Scales of Infant and Toddler Development, Third Edition (BSID-III) and compared between cohorts using generalized estimating equation models adjusted for confounders.

View Article and Find Full Text PDF

Incidence of fall-from-height injuries and predictive factors for severity.

J Osteopath Med

January 2025

McAllen Department of Trauma, South Texas Health System, McAllen, TX, USA.

Context: The injuries caused by falls-from-height (FFH) are a significant public health concern. FFH is one of the most common causes of polytrauma. The injuries persist to be significant adverse events and a challenge regarding injury severity assessment to identify patients at high risk upon admission.

View Article and Find Full Text PDF

Context: Point-of-care ultrasound (POCUS) has diverse applications across various clinical specialties, serving as an adjunct to clinical findings and as a tool for increasing the quality of patient care. Owing to its multifunctionality, a growing number of medical schools are increasingly incorporating POCUS training into their curriculum, some offering hands-on training during the first 2 years of didactics and others utilizing a longitudinal exposure model integrated into all 4 years of medical school education. Midwestern University Arizona College of Osteopathic Medicine (MWU-AZCOM) adopted a 4-year longitudinal approach to include POCUS education in 2017.

View Article and Find Full Text PDF

Objectives: To assess the usefulness of sentinel lymph node biopsy (SLNB) in patients with early-stage oral squamous cell carcinoma (OSCC).

Materials And Methods: Seventy-five patients (mean age 62 years) diagnosed with cT1-2 N0 underwent SLNB with Tc, lymphoscintigraphy/SPECT-CT, and gamma probe detection with intraoperative histological examination of the resected sentinel lymph nodes (SLNs). Elective neck dissection was performed during the same surgical procedure of primary tumor resection when malignant deposits were detected microscopically.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!