Assessing inter-rater reliability when the raters are fixed: Two concepts and two estimates.

Biom J

Statistical Unit, Institute for Social and Preventive Medicine, University Hospital Center Lausanne, Route de la Corniche 2, CH-1066 Epalinges, Switzerland.

Published: May 2011

Intraclass correlation (ICC) is an established tool to assess inter-rater reliability. In a seminal paper published in 1979, Shrout and Fleiss considered three statistical models for inter-rater reliability data with a balanced design. In their first two models, an infinite population of raters was considered, whereas in their third model, the raters in the sample were considered to be the whole population of raters. In the present paper, we show that the two distinct estimates of ICC developed for the first two models can both be applied to the third model and we discuss their different interpretations in this context.

Download full-text PDF

Source
http://dx.doi.org/10.1002/bimj.201000066DOI Listing

Publication Analysis

Top Keywords

inter-rater reliability
12
population raters
8
third model
8
assessing inter-rater
4
raters
4
reliability raters
4
raters fixed
4
fixed concepts
4
concepts estimates
4
estimates intraclass
4

Similar Publications

Objectives: This article aims to evaluate the use and effects of an artificial intelligence system supporting a critical diagnostic task during radiology resident training, addressing a research gap in this field.

Materials And Methods: We involved eight residents evaluating 150 CXRs in three scenarios: no AI, on-demand AI, and integrated-AI. The considered task was the assessment of a multi-regional severity score of lung compromise in patients affected by COVID-19.

View Article and Find Full Text PDF

Objective: Spinopelvic sagittal balance ensures efficient posture and minimizes energy expenditure by aligning the spine, pelvis, and lower extremities. Deviations can cause clinical issues like back pain and functional limitations. Key radiographic parameters, including pelvic tilt (PT), pelvic incidence (PI), sacral slope (SS), and lumbar lordosis (LL), are essential for evaluating spinal pathologies and planning surgeries.

View Article and Find Full Text PDF

Serum KNG and FVIII may serve as potential biomarkers for depression.

Behav Brain Res

January 2025

Department of Psychiatry, Renmin Hospital of Wuhan University, Wuhan 430060, Hubei, PR China; Department of Psychiatry and Institute of Neuropsychiatry, Renmin Hospital of Wuhan University, Wuhan 430060, Hubei, PR China; Taikang Center for Life and Medical Sciences, Wuhan University, Wuhan, 430071, China. Electronic address:

Background: The global burden of major depressive disorder (MDD) is rising, with current diagnostic methods hindered by significant subjectivity and low inter-rater reliability. Several studies have implied underlying link between coagulation-related proteins, such as kininogen (KNG) and coagulation factor VIII (FVIII), and depressive symptoms, offering new insights into the exploration of depression biomarkers. This study aims to elucidate the roles of KNG and FVIII in depression, potentially providing a foundational basis for biomarker research in this field.

View Article and Find Full Text PDF

Background And Objectives: Telemedicine has become a mainstay of ALS clinical care, but there is currently no standardized approach for assessing and tracking changes to the neurologic examination in this format. The goal of this study was to create a standardized telemedicine-based motor examination scale to objectively and reliably track ALS progression and use Rasch methodology to validate the scale and improve its psychometric properties.

Methods: A draft telemedicine examination scale with 25 items assessing movement in the bulbar muscles, neck, trunk, and extremities was created by an ALS expert panel, incorporating input from patient advisors.

View Article and Find Full Text PDF

Background: Telestroke assessments are widely used to remotely assess adults with suspected stroke, although they have not been studied in children. SPOT, the Study of Performing the PedNIHSS Over Televideo, tested the feasibility of assessing the Pediatric National Institutes of Health Stroke Scale (PedNIHSS) by televideo in children.

Methods: Children aged 2 to 17 years with and without strokes were recruited and examined in the outpatient neurology clinic.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!