A suggestive approach for assessing item quality, usability and validity of Automatic Item Generation.

Adv Health Sci Educ Theory Pract

Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Largo Do Paço, 4710-057, Braga, Portugal.

Published: December 2023

Automatic Item Generation (AIG) refers to the process of using cognitive models to generate test items using computer modules. It is a new but rapidly evolving research area where cognitive and psychometric theory are combined into digital framework. However, assessment of the item quality, usability and validity of AIG relative to traditional item development methods lacks clarification. This paper takes a top-down strong theory approach to evaluate AIG in medical education. Two studies were conducted: Study I-participants with different levels of clinical knowledge and item writing experience developed medical test items both manually and through AIG. Both item types were compared in terms of quality and usability (efficiency and learnability); Study II-Automatically generated items were included in a summative exam in the content area of surgery. A psychometric analysis based on Item Response Theory inspected the validity and quality of the AIG-items. Items generated by AIG presented quality, evidences of validity and were adequate for testing student's knowledge. The time spent developing the contents for item generation (cognitive models) and the number of items generated did not vary considering the participants' item writing experience or clinical knowledge. AIG produces numerous high-quality items in a fast, economical and easy to learn process, even for inexperienced and without clinical training item writers. Medical schools may benefit from a substantial improvement in cost-efficiency in developing test items by using AIG. Item writing flaws can be significantly reduced thanks to the application of AIG's models, thus generating test items capable of accurately gauging students' knowledge.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10700404PMC
http://dx.doi.org/10.1007/s10459-023-10225-yDOI Listing

Publication Analysis

Top Keywords

test items
16
item
12
quality usability
12
item generation
12
item writing
12
item quality
8
usability validity
8
automatic item
8
cognitive models
8
items
8

Similar Publications

The prevalence of hepatitis B virus infection remains high in the Democratic Republic of Congo (DRC), constituting a public health problem in view of the fatal complications it causes, notably cirrhosis and hepatocellular carcinoma. The aim of this study was to provide an overview of the situation of viral hepatitis B in the DRC and in particular its implications for public health. A systematic review was conducted according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) group guidelines.

View Article and Find Full Text PDF

Classroom Behavior Recognition Using Computer Vision: A Systematic Review.

Sensors (Basel)

January 2025

Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China.

Behavioral computing based on visual cues has become increasingly important, as it can capture and annotate teachers' and students' classroom states on a large scale and in real time. However, there is a lack of consensus on the research status and future trends of computer vision-based classroom behavior recognition. The present study conducted a systematic literature review of 80 peer-reviewed journal articles following the Preferred Reporting Items for Systematic Assessment and Meta-Analysis (PRISMA) guidelines.

View Article and Find Full Text PDF

: This study aims to describe and analyze the indications and clinical results of total TMJ replacement in participants with degenerative and/or inflammatory joint diseases, defining patient and intervention conditions. : A systematic review was conducted according to the Cochrane Handbook for Systematic Reviews of Intervention and reported according to the PRISMA Items update. The search strategy was from 1997 to July 2024 in Pubmed, Embase, Scopus, and Web of Science.

View Article and Find Full Text PDF

Citizen science activities were performed using sheep as an animal model and involving 252 students aged between 9 and 11 years. The study focused on three pillars: hill/mountain landscape biodiversity, animal welfare and the social utility of research. Two types of tests-"attitude questionnaires" (AQs) and "maximum performance tests" (MPTs)-were administered.

View Article and Find Full Text PDF

Design, Content and Ecological Validity and Reliability of the Physical Activity and Sport Habits Questionnaire for Children Aged 8-12 Years in the Province of Gipuzkoa (Spain).

Children (Basel)

January 2025

Research Group in Physical Activity, Physical Exercise and Sport (AKTIBOki) and Society, Sport and Physical Activity (GIKAFIT) Research Group, Department of Physical Education and Sports, Faculty of Education and Sport, University of the Basque Country (UPV/EHU), 01007 Vitoria-Gasteiz, Spain.

This study aimed to develop a questionnaire to describe and diagnose the physical activity and sport (PAS) habits of 8-12-year-old schoolchildren, assessing its content, ecological validity and reliability, from a multidimensional perspective aligned with Global Matrix 4.0 indicators. The questionnaire design phase involved seven individuals from the university sector and sport managers from the Gipuzkoa Provincial Council.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!