Item response theory (IRT) is the statistical paradigm underlying a dominant family of generative probabilistic models for test responses, used to quantify traits in individuals relative to target populations. The graded response model (GRM) is a particular IRT model that is used for ordered polytomous test responses. Both the development and the application of the GRM and other IRT models require statistical decisions. For formulating these models (calibration), one needs to decide on methodologies for item selection, inference, and regularization. For applying these models (test scoring), one needs to make similar decisions, often prioritizing computational tractability and/or interpretability. In many applications, such as in the Work Disability Functional Assessment Battery (WD-FAB), tractability implies approximating an individual's score distribution using estimates of mean and variance, and obtaining that score conditional on only point estimates of the calibrated model. In this manuscript, we evaluate the calibration and scoring of models under this common use-case using Bayesian cross-validation. Applied to the WD-FAB responses collected for the National Institutes of Health, we assess the predictive power of implementations of the GRM based on their ability to yield, on validation sets of respondents, ability estimates that are most predictive of patterns of item responses. Our main finding indicates that regularized Bayesian calibration of the GRM outperforms the regularization-free empirical Bayesian procedure of marginal maximum likelihood. We also motivate the use of compactly supported priors in test scoring.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8993025PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266350PLOS

Publication Analysis

Top Keywords

regularized bayesian
8
bayesian calibration
8
calibration scoring
8
irt model
8
marginal maximum
8
maximum likelihood
8
models test
8
test responses
8
grm irt
8
test scoring
8

Similar Publications

Background: Multi-domain initiatives which target modifiable, lifestyle-associated, dementia risk factors are promising tools for dementia prevention. However, those at greatest risk of preventable dementia likely have the least capacity to enact change. Interventions may improve outcomes for those most vulnerable by looking up-stream.

View Article and Find Full Text PDF

Background: Elderly individuals living alone represent a vulnerable group with limited family support, making them more susceptible to mental health issues such as depression and anxiety. This study aims to construct a network model of depression and anxiety symptoms among older adults living alone, exploring the correlations and centrality of different symptoms. The goal is to identify core and bridging symptoms to inform clinical interventions.

View Article and Find Full Text PDF

The objectives of this study were to evaluate different machine learning algorithms for predicting body weight (BW) in Sujiang pigs using the following morphological traits: age, body length (BL), backfat thickness (BFT), chest circumference (CC), body height (BH), chest width (CW), and hip width (HW). Additionally, this study also investigated which machine learning algorithms could accurately and efficiently predict body weight in pigs using a limited set of morphological traits. For this purpose, morphological measurements of 365 mature (180 ± 5 days) Sujiang pigs from the Jiangsu Sujiang Pig Breeding Farm in Taizhou, Jiangsu Province, China were used.

View Article and Find Full Text PDF

Detecting brain tumours (BT) early improves treatment possibilities and increases patient survival rates. Magnetic resonance imaging (MRI) scanning offers more comprehensive information, such as better contrast and clarity, than any alternative scanning process. Manually separating BTs from several MRI images gathered in medical practice for cancer analysis is challenging and time-consuming.

View Article and Find Full Text PDF

Objective: Cancer survivors often face significant health-related quality of life (HRQoL) challenges. Although exercise has been proven to improve HRQoL in cancer survivors, the optimal dose and intensity of exercise for this population has not been fully determined. Adherence to exercise may vary based on exercise intensity, affecting results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!