Reliability generalization (RG) is a meta-analytic technique that allows for the systematic examination of variation in score reliability for different samples of test takers; this procedure is based on the recognition that reliability is not a stable property of a test but is sample dependent. As a demonstration of an RG analysis, I obtained 63 reliability coefficients for each of the MMPI-2 (Butcher et al., 2001) Personality Psychopathology 5 (Harkness, McNulty, & Ben-Porath, 1995) scales. The overall variability of alpha coefficients supports the argument that reliability is sample dependent and underscores the need for researchers to calculate reliability estimates based on their research samples rather than simply citing published alpha coefficients as evidence of score reliability. I observed statistically significant mean reliability differences for scores across the 5 scales, with the highest level of reliability observed for scores on the measure of Negative Emotionality and the lowest levels of reliability observed for scores on the measures of Aggression and Disconstraint. There was no evidence that the sex-composition of a sample was systematically related to score reliability, and there were no statistically significant differences in reliability between scores obtained with the English version of the test and those obtained with translated forms. However, reliability was consistently lower for scores on some scales when the data were obtained in nonclinical settings as opposed to clinical ones. Sample size was not significantly correlated with reliability estimates. RG methods have the potential for deepening the level of understanding about the role of reliability in the evaluation and use of personality tests.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1080/00223890701293908 | DOI Listing |
J Med Internet Res
January 2025
Institute of Medical Teaching and Medical Education Research, University Hospital Würzburg, Würzburg, Germany.
Background: Objective structured clinical examinations (OSCEs) are a widely recognized and accepted method to assess clinical competencies but are often resource-intensive.
Objective: This study aimed to evaluate the feasibility and effectiveness of a virtual reality (VR)-based station (VRS) compared with a traditional physical station (PHS) in an already established curricular OSCE.
Methods: Fifth-year medical students participated in an OSCE consisting of 10 stations.
J Med Internet Res
January 2025
Department of Psychiatry, Yongin Severance Hospital, Yongin, Republic of Korea.
Background: The COVID-19 pandemic has accelerated the digitalization of modern society, extending digital transformation to daily life and psychological evaluation and treatment. However, the development of competencies and literacy in handling digital technology has not kept pace, resulting in a significant disparity among individuals. Existing measurements of digital literacy were developed before widespread information and communications technology device adoption, mainly focusing on one's perceptions of their proficiency and the utility of device operation.
View Article and Find Full Text PDFOtol Neurotol
February 2025
Department of Radiology, Yale School of Medicine, New Haven, CT.
Background: Vestibular schwannoma (VS) is a common intracranial tumor that affects patients' quality of life. Reliable imaging techniques for tumor volume assessment are essential for guiding management decisions. The study aimed to compare the ABC/2 method to the gold standard planimetry method for volumetric assessment of VS.
View Article and Find Full Text PDFOtol Neurotol
February 2025
Department of Radiology, Columbia University Irving Medical Center, New York, NY, USA.
Objective: To compare the diagnostic capability of Pöschl reformations created from temporal bone CT (TBCT) and high-resolution noncontrast CT head exams (HR-NECTH) to detect and classify superior semicircular canal (SSC) abnormalities.
Study Design: Retrospective case review.
Setting: Tertiary referral center.
Otol Neurotol
February 2025
Department of Otolaryngology-Head and Neck Surgery.
Objective: To compare fall risk scores of hearing aids embedded with inertial measurement units (IMU-HAs) and powered by artificial intelligence (AI) algorithms with scores by trained observers.
Study Design: Prospective, double-blinded, observational study of fall risk scores between trained observers and those of IMU-HAs.
Setting: Tertiary referral center.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!