Objective: To compare binary metrics and Global Evaluative Assessment of Robotic Skills (GEARS) evaluations of training outcome assessments for reliability, sensitivity, and specificity.

Background: GEARS-Likert-scale skills assessment are a widely accepted tool for robotic surgical training outcome evaluations. Proficiency-based progression (PBP) training is another methodology but uses binary performance metrics for evaluations.

Methods: In a prospective, randomized, and blinded study, we compared conventional with PBP training for a robotic suturing, knot-tying anastomosis task. Thirty-six surgical residents from 16 Belgium residency programs were randomized. In the skills laboratory, the PBP group trained until they demonstrated a quantitatively defined proficiency benchmark. The conventional group were yoked to the same training time but without the proficiency requirement. The final trial was video recorded and assessed with binary metrics and GEARS by robotic surgeons blinded to individual, group, and residency program. Sensitivity and specificity of the two assessment methods were evaluated with area under the curve (AUC) and receiver operating characteristics (ROC) curves.

Results: The PBP group made 42% fewer objectively assessed performance errors than the conventional group ( < 0.001) and scored 15% better on the GEARS assessment ( = 0.033). The mean interrater reliability for binary metrics and GEARS was 0.87 and 0.38, respectively. Binary total error metrics AUC was 97% and for GEARS 85%. With a sensitivity threshold of 0.8, false positives rates were 3% and 25% for, respectively, the binary and GEARS assessments.

Conclusions: Binary metrics for scoring a robotic VUA task demonstrated better psychometric properties than the GEARS assessment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513364PMC
http://dx.doi.org/10.1097/AS9.0000000000000307DOI Listing

Publication Analysis

Top Keywords

binary metrics
16
reliability sensitivity
8
sensitivity specificity
8
robotic surgical
8
global evaluative
8
evaluative assessment
8
assessment robotic
8
robotic skills
8
binary
8
training outcome
8

Similar Publications

Multiclass Synthetic Accessibility Prediction.

J Chem Inf Model

January 2025

X-Chem Global HQ, 100 Beaver Street, Waltham, Massachusetts 02453, United States.

Evaluating synthetic accessibility of molecules is an integral component of the drug discovery process. While the application of machine learning models to predict whether small molecules are easy or hard to synthesize has gained attention recently, predetermined thresholds and data set imbalances present challenges for these binary classification approaches. In this study, we introduce a novel multiclass fold-ensembled classification approach to predict the minimum number of steps needed to synthesize a small molecule.

View Article and Find Full Text PDF

This study aimed to determine the chromatographic retention and dissociation/protonation constant (pK) values of lapatinib and tamoxifen, key drugs used in metastatic breast cancer treatment, at 37°C using both conventional and green high-performance liquid chromatography (HPLC) methods. Qualitative analysis was conducted on an XTerra C18 column (250 ×4.6 mm I.

View Article and Find Full Text PDF

<b>Introduction:</b> Deep neck infections (DNI) are potentially life-threatening conditions. The infections are commonly polymicrobial, and develop as a result of oral cavity infections, tonsilitis, laryngitis, trauma, or malignancy. Timely diagnostics and management are essential to prevent severe complications such as airway obstruction, sepsis or mediastinitis.

View Article and Find Full Text PDF

Background: Dysglycaemia metrics, defined as hyperglycaemia, increased glucose variability, hypoglycaemia and reduced time in the targeted blood glucose range (TIR), are linked to higher mortality. The relationship between dysglycaemia metrics and intensive care unit (ICU) mortality over time for patients with and without diabetes remains inconclusive, posing challenges for ICU medical staff in accurately identifying and distinguishing various risk factors and taking timely and appropriate responses.

Aim: To explore which dysglycaemia metrics within the first 7 days of ICU stay are associated with ICU mortality among patients with and without diabetes.

View Article and Find Full Text PDF

VirDetect-AI: a residual and convolutional neural network-based metagenomic tool for eukaryotic viral protein identification.

Brief Bioinform

November 2024

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos 62210, México.

This study addresses the challenging task of identifying viruses within metagenomic data, which encompasses a broad array of biological samples, including animal reservoirs, environmental sources, and the human body. Traditional methods for virus identification often face limitations due to the diversity and rapid evolution of viral genomes. In response, recent efforts have focused on leveraging artificial intelligence (AI) techniques to enhance accuracy and efficiency in virus detection.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!