Prediction of true test scores from observed item scores and ancillary data.

Br J Math Stat Psychol

ETS, Princeton, New Jersey, USA.

Published: May 2015

In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability.

Download full-text PDF

Source
http://dx.doi.org/10.1111/bmsp.12052DOI Listing

Publication Analysis

Top Keywords

test scores
8
item scores
8
traditional test
8
general analytical
8
analytical writing
8
ibt writing
8
variances covariances
8
covariances measurement
8
measurement errors
8
scores
5

Similar Publications

Background: Opioid medications are important for pain management, but many patients progress to unsafe medication use. With few personalized and accessible behavioral treatment options to reduce potential opioid-related harm, new and innovative patient-centered approaches are urgently needed to fill this gap.

Objective: This study involved the first phase of co-designing a digital brief intervention to reduce the risk of opioid-related harm by investigating the lived experience of chronic noncancer pain (CNCP) in treatment-seeking patients, with a particular focus on opioid therapy experiences.

View Article and Find Full Text PDF

Objectives: The aim of this study was to develop and validate a nomogram model that predicts the risk of bone metastasis (BM) in a prostate cancer (PCa) population.

Methods: We retrospectively collected and analyzed the clinical data of patients with pathologic diagnosis of PCa from January 1, 2013 to December 31, 2022 in two hospitals in Yangzhou, China. Patients from the Affiliated Hospital of Yangzhou University were divided into a training set and patients from the Affiliated Clinical College of Traditional Chinese Medicine of Yangzhou University were divided into a validation set.

View Article and Find Full Text PDF

The impact of cognitive decline in older adults can be evaluated with dual-task gait (DTG) testing in which a cognitive task is performed during walking, leading to increased costs of gait. Previous research demonstrated that higher DTG costs correlate with increasing cognitive deficits and with age. The present study was conducted to explore whether the relationship between the DTG costs and cognitive abilities in older individuals is influenced by sex differences.

View Article and Find Full Text PDF

Several interventional strategies have been implemented in malaria endemic areas where the burden is high, that include among others, intermittent preventive treatment (IPT), a tactic that blocks transmission and can reduce disease morbidity. However, the implementation IPT strategies raises a genuine concern, intervening the development of naturally acquired immunity to malaria which requires continuous contact with parasite antigens. This study investigated whether dihydroartemisinin-piperaquine (DP) or artesunate-amodiaquine (ASAQ) IPT in schoolchildren (IPTsc) impairs IgG reactivity to six malaria antigens.

View Article and Find Full Text PDF

Background: The implementation of large language models (LLMs), such as BART (Bidirectional and Auto-Regressive Transformers) and GPT-4, has revolutionized the extraction of insights from unstructured text. These advancements have expanded into health care, allowing analysis of social media for public health insights. However, the detection of drug discontinuation events (DDEs) remains underexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!