Very-short-answer questions: reliability, discrimination and acceptability.

Amir H Sam Samantha M Field Carlos F Collares Cees P M van der Vleuten Val J Wass Colin Melville Joanne Harris Karim Meeran

Med Educ

Medical Education Research Unit, Imperial College London, London, UK.

Published: April 2018

The study compared very-short-answer questions (VSAQs) to single-best-answer questions (SBAQs) for assessing medical students, focusing on aspects like reliability, discrimination, and performance.
VSAQs demonstrated high reliability (alpha: 0.91) and had a lower standard error of measurement compared to SBAQs, with students usually scoring lower on VSAQs.
Despite 80.4% of students finding VSAQs more challenging, 69.2% considered them to be more authentic, highlighting the VSAQs' potential validity over SBAQs, which were more prone to cueing.

Context: Single-best-answer questions (SBAQs) have been widely used to test knowledge because they are easy to mark and demonstrate high reliability. However, SBAQs have been criticised for being subject to cueing.

Objectives: We used a novel assessment tool that facilitates efficient marking of open-ended very-short-answer questions (VSAQs). We compared VSAQs with SBAQs with regard to reliability, discrimination and student performance, and evaluated the acceptability of VSAQs.

Methods: Medical students were randomised to sit a 60-question assessment administered in either VSAQ and then SBAQ format (Group 1, n = 155) or the reverse (Group 2, n = 144). The VSAQs were delivered on a tablet; responses were computer-marked and subsequently reviewed by two examiners. The standard error of measurement (SEM) across the ability spectrum was estimated using item response theory.

Results: The review of machine-marked questions took an average of 1 minute, 36 seconds per question for all students. The VSAQs had high reliability (alpha: 0.91), a significantly lower SEM than the SBAQs (p < 0.001) and higher mean item-total point biserial correlations (p < 0.001). The VSAQ scores were significantly lower than the SBAQ scores (p < 0.001). The difference in scores between VSAQs and SBAQs was attenuated in Group 2. Although 80.4% of students found the VSAQs more difficult, 69.2% found them more authentic.

Conclusions: The VSAQ format demonstrated high reliability and discrimination and items were perceived as more authentic. The SBAQ format was associated with significant cueing. The present results suggest the VSAQ format has a higher degree of validity.

Download full-text PDF	Source
http://dx.doi.org/10.1111/medu.13504	DOI Listing

Publication Analysis

Top Keywords

reliability discrimination

high reliability

very-short-answer questions

vsaqs sbaqs

sbaq format

students vsaqs

vsaq format

vsaqs

reliability

sbaqs

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered