Transformer-Based Deep Neural Language Modeling for Construct-Specific Automatic Item Generation.

Björn E Hommel Franz-Josef M Wollang Veronika Kotova Hannes Zacher Stefan C Schmukle

Psychometrika

Department of Work and Organizational Psychology, Institute of Psychology - Wilhelm Wundt, Leipzig University, Neumarkt 9-19, Leipzig, 04109, Germany.

Published: June 2022

Algorithmic automatic item generation can be used to obtain large quantities of cognitive items in the domains of knowledge and aptitude testing. However, conventional item models used by template-based automatic item generation techniques are not ideal for the creation of items for non-cognitive constructs. Progress in this area has been made recently by employing long short-term memory recurrent neural networks to produce word sequences that syntactically resemble items typically found in personality questionnaires. To date, such items have been produced unconditionally, without the possibility of selectively targeting personality domains. In this article, we offer a brief synopsis on past developments in natural language processing and explain why the automatic generation of construct-specific items has become attainable only due to recent technological progress. We propose that pre-trained causal transformer models can be fine-tuned to achieve this task using implicit parameterization in conjunction with conditional generation. We demonstrate this method in a tutorial-like fashion and finally compare aspects of validity in human- and machine-authored items using empirical data. Our study finds that approximately two-thirds of the automatically generated items show good psychometric properties (factor loadings above .40) and that one-third even have properties equivalent to established and highly curated human-authored items. Our work thus demonstrates the practical use of deep neural networks for non-cognitive automatic item generation.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166894	PMC
http://dx.doi.org/10.1007/s11336-021-09823-9	DOI Listing

Publication Analysis

Top Keywords

automatic item

item generation

deep neural

items

neural networks

generation

automatic

item

transformer-based deep

neural language

Similar Publications

Deep Learning-Derived Quantitative Scores for Chronic Rhinosinusitis Assessment: Correlation With Quality of Life Outcomes.

Am J Rhinol Allergy

January 2025

Department of Radiology, Hangzhou First People's Hospital, Hangzhou, P. R. China.

Zhefan Shen Ying Wei Kexin Liu Zhiqi Ma Zhiliang Zhang

Background: Computed tomography (CT) plays a crucial role in assessing chronic rhinosinusitis, but lacks objective quantifiable indicators.

Objective: This study aimed to use deep learning for automated sinus segmentation to generate distinct quantitative scores and explore their correlations with disease-specific quality of life.

Methods: From July 2021 to August 2022, 445 CT data were collected from 2 medical centers.

View Article and Find Full Text PDF

Similar Publications

Verification of the Reliability of an Automated Urine Test Strip Colorimetric Program Using Colorimetric Analysis: Survey Study.

JMIR Form Res

January 2025

Hamamatsu University School of Medicine, Hamamatsu City, Chuo-ku, Japan.

Keigo Inagaki Daisuke Tsuriya Takuya Hashimoto Katsumasa Nakamura

Background: One method for noninvasive and simple urinary microalbumin testing is urine test strips. However, when visually assessing urine test strips, accurate assessment may be difficult due to environmental influences-such as lighting color and intensity-and the physical and psychological influences of the assessor. These complicate the formation of an objective assessment.

View Article and Find Full Text PDF

Similar Publications

Use of assistive technology to assess distal motor function in subjects with neuromuscular disease.

PLOS Digit Health

January 2025

Centre Référent Maladies Rares Neuromusculaires, Service de Médecine Physique et de Réadaptation Pédiatrique des Hospices Civils de Lyon - Hôpital Femme Mère Enfant, Bron, France.

Dominique Vincent-Genod Sylvain Roche Aurélie Barrière Capucine de Lattre Marie Tinat

Unlabelled: Among the 32 items of the Motor Function Measure scale, 3 concern the assessment of hand function on a paper-based support. Their characteristics make it possible to envisage the use of a tablet instead of the original paper-based support for their completion. This would then make it possible to automate the score to reduce intra- and inter-individual variability.

View Article and Find Full Text PDF

Similar Publications

The equivalent value (EV)-based workload assessment of primary healthcare workers in Beijing, China.

Hum Resour Health

January 2025

Health Development Research Department, Capital Institute of Pediatrics, Beijing, 100020, People's Republic of China.

Shasha Yuan Tao Yin Naijie Weng Zheng Wang Delu Yin

Background: Quantitative methods for estimating the workload of primary healthcare (PHC) workers are essential for improving the performance of PHC institutions. However, measuring the workload of PHC workers is challenging due to the diverse and complex range of services covered by PHC. This study aims to use an equivalent value (EV)-based approach to assess the workload of PHC workers and inform policymakers about the current workload burden in Beijing, China.

View Article and Find Full Text PDF

Similar Publications

Analyzing gait data measured by wearable cyborg hybrid assistive limb during assisted walking: gait pattern clustering.

Front Med Technol

December 2024

Institute of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan.

Yasuko Namikawa Hiroaki Kawamoto Akira Uehara Yoshiyuki Sankai

Introduction: The wearable cyborg Hybrid Assistive Limb (HAL) is a therapeutic exoskeletal device that provides voluntary gait assistance using kinematic/kinetic gait data and bioelectrical signals. By utilizing the gait data automatically measured by HAL, we are developing a system to analyze the wearer's gait during the intervention, unlike conventional evaluations that compare pre- and post-treatment gait test results. Despite the potential use of the gait data from the HAL's sensor information, there is still a lack of analysis using such gait data and knowledge of gait patterns during HAL use.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!