Transformer-Based Deep Neural Language Modeling for Construct-Specific Automatic Item Generation.

Psychometrika

Department of Work and Organizational Psychology, Institute of Psychology - Wilhelm Wundt, Leipzig University, Neumarkt 9-19, Leipzig, 04109, Germany.

Published: June 2022

Algorithmic automatic item generation can be used to obtain large quantities of cognitive items in the domains of knowledge and aptitude testing. However, conventional item models used by template-based automatic item generation techniques are not ideal for the creation of items for non-cognitive constructs. Progress in this area has been made recently by employing long short-term memory recurrent neural networks to produce word sequences that syntactically resemble items typically found in personality questionnaires. To date, such items have been produced unconditionally, without the possibility of selectively targeting personality domains. In this article, we offer a brief synopsis on past developments in natural language processing and explain why the automatic generation of construct-specific items has become attainable only due to recent technological progress. We propose that pre-trained causal transformer models can be fine-tuned to achieve this task using implicit parameterization in conjunction with conditional generation. We demonstrate this method in a tutorial-like fashion and finally compare aspects of validity in human- and machine-authored items using empirical data. Our study finds that approximately two-thirds of the automatically generated items show good psychometric properties (factor loadings above .40) and that one-third even have properties equivalent to established and highly curated human-authored items. Our work thus demonstrates the practical use of deep neural networks for non-cognitive automatic item generation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166894PMC
http://dx.doi.org/10.1007/s11336-021-09823-9DOI Listing

Publication Analysis

Top Keywords

automatic item
16
item generation
16
deep neural
8
items
8
neural networks
8
generation
6
automatic
5
item
5
transformer-based deep
4
neural language
4

Similar Publications

Background: Computed tomography (CT) plays a crucial role in assessing chronic rhinosinusitis, but lacks objective quantifiable indicators.

Objective: This study aimed to use deep learning for automated sinus segmentation to generate distinct quantitative scores and explore their correlations with disease-specific quality of life.

Methods: From July 2021 to August 2022, 445 CT data were collected from 2 medical centers.

View Article and Find Full Text PDF

Background: One method for noninvasive and simple urinary microalbumin testing is urine test strips. However, when visually assessing urine test strips, accurate assessment may be difficult due to environmental influences-such as lighting color and intensity-and the physical and psychological influences of the assessor. These complicate the formation of an objective assessment.

View Article and Find Full Text PDF

Use of assistive technology to assess distal motor function in subjects with neuromuscular disease.

PLOS Digit Health

January 2025

Centre Référent Maladies Rares Neuromusculaires, Service de Médecine Physique et de Réadaptation Pédiatrique des Hospices Civils de Lyon - Hôpital Femme Mère Enfant, Bron, France.

Unlabelled: Among the 32 items of the Motor Function Measure scale, 3 concern the assessment of hand function on a paper-based support. Their characteristics make it possible to envisage the use of a tablet instead of the original paper-based support for their completion. This would then make it possible to automate the score to reduce intra- and inter-individual variability.

View Article and Find Full Text PDF

The equivalent value (EV)-based workload assessment of primary healthcare workers in Beijing, China.

Hum Resour Health

January 2025

Health Development Research Department, Capital Institute of Pediatrics, Beijing, 100020, People's Republic of China.

Background: Quantitative methods for estimating the workload of primary healthcare (PHC) workers are essential for improving the performance of PHC institutions. However, measuring the workload of PHC workers is challenging due to the diverse and complex range of services covered by PHC. This study aims to use an equivalent value (EV)-based approach to assess the workload of PHC workers and inform policymakers about the current workload burden in Beijing, China.

View Article and Find Full Text PDF

Introduction: The wearable cyborg Hybrid Assistive Limb (HAL) is a therapeutic exoskeletal device that provides voluntary gait assistance using kinematic/kinetic gait data and bioelectrical signals. By utilizing the gait data automatically measured by HAL, we are developing a system to analyze the wearer's gait during the intervention, unlike conventional evaluations that compare pre- and post-treatment gait test results. Despite the potential use of the gait data from the HAL's sensor information, there is still a lack of analysis using such gait data and knowledge of gait patterns during HAL use.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!