Utility of large language models for creating clinical assessment items.

George Lam Yusra Shammoon Anna Coulson Felicity Lalloo Arti Maini Anjali Amin Celia Brown Amir H Sam

Med Teach

Imperial College School of Medicine, Imperial College London, London, UK.

Published: August 2024

Purpose: To compare student performance, examiner perceptions and cost of GPT-assisted (generative pretrained transformer-assisted) clinical and professional skills assessment (CPSAs) items against items created using standard methods.

Methods: We conducted a prospective, controlled, double-blinded comparison of CPSA items developed using GPT-assistance with those created through standard methods. Two sets of six practical cases were developed for a formative assessment sat by final year medical students. One clinical case in each set was created with GPT-assistance. Students were assigned to one of the two sets.

Results: The results of 239 participants were analysed in the study. There was no statistically significant difference in item difficulty, or discriminative ability between GPT-assisted and standard items. One hundred percent ( = 15) of respondents to an examiner feedback questionnaire felt GPT-assisted cases were appropriately difficult and realistic. GPT-assistance resulted in significant labour cost savings, with a mean reduction of 57% (880 GBP) in labour cost per case when compared to standard case drafting methods.

Conclusions: GPT-assistance can create CPSA items of comparable quality with significantly less cost when compared to standard methods. Future studies could evaluate GPT's ability to create CPSA material in other areas of clinical practice, aiming to validate the generalisability of these findings.

Download full-text PDF	Source
http://dx.doi.org/10.1080/0142159X.2024.2382860	DOI Listing

Publication Analysis

Top Keywords

created standard

cpsa items

standard methods

labour cost

compared standard

create cpsa

items

standard

utility large

large language

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!