Inter-rater reliability in performance status assessment among health care professionals: a systematic review.

Ronald Chow Nicholas Chiu Eduardo Bruera Monica Krishnan Leonard Chiu Henry Lam Carlo DeAngelis Natalie Pulenzas Sherlyn Vuong Edward Chow

Ann Palliat Med

Sunnybrook Odette Cancer Centre, University of Toronto, Toronto, ON, Canada.

Published: April 2016

Background: Studies have reported that performance status (PS) is a good prognostic indicator in patients with advanced cancer. However, different health care professionals (HCPs) could grade PS differently. The purpose of this review is to investigate the PS scores evaluated by different HCPs as reported in the literature.

Methods: A literature search was conducted in Ovid MEDLINE and OLDMEDLINE from 1946 to Present (July 5, 2015), Embase Classic and Embase from 1947 to 2015 Week 26, and Cochrane Central Register of Controlled Trials up to May 2015. Information of interest was whether there was a difference of PS assessment between HCPs. Other statistical information provided to assess the agreement in ratings, such as Cohen's kappa coefficient, Krippendorff's alpha coefficient, Spearman Rank Coefficient, and Kendall's correlation, was noted.

Results: Of the fifteen articles, eleven compared PS assessments between HCPs of different disciplines, one between the attending and resident physician, two between similarly-specialized physicians, and one between two unspecified-specialty physicians. Three studies reported a lack of agreement (kappa =0.19-0.26; Krippendorff's alpha =0.61-0.63), four reported moderate inter-rater reliability (kappa =0.31-0.72), two reported mixed reliability, and six reported strong reliability (kappa =0.91-0.92; Spearman rank correlation =0.6-1.0; Kendall's correlation =0.75-0.82). Four studies reported that Karnofsky performance status (KPS) had better inter-rater reliability than both the Eastern Cooperative Oncology Group Performance Status (ECOG PS) and the palliative performance scale (PPS).

Conclusions: The existing literature cites both good and bad inter-rater reliability of PS scores. It is difficult to conclude which HCPs' PS assessments are more accurate.

Download full-text PDF	Source
http://dx.doi.org/10.21037/apm.2016.03.02	DOI Listing

Publication Analysis

Top Keywords

inter-rater reliability

performance status

studies reported

health care

care professionals

krippendorff's alpha

spearman rank

kendall's correlation

reliability kappa

reported

Similar Publications

Using artificial intelligence to evaluate adherence to best practices in one anastomosis gastric bypass: first steps in a real-world setting.

Surg Endosc

January 2025

Division of General Surgery, Bariatric Unit, Tel Aviv Medical Center, Affiliated to Sackler Faculty of Medicine, Tel Aviv University, 6, Weizman St, 6423906, Tel- Aviv, Israel.

Danit Dayan Eran Nizri Andrei Keidar

Background: Safety in one anastomosis gastric bypass (OAGB) is judged by outcomes, but it seems reasonable to utilize best practices for safety, whose performance can be evaluated and therefore improved. We aimed to test an artificial intelligence-based model in real world for the evaluation of adherence to best practices in OAGB.Please check and confirm that the authors and their respective affiliations have been correctly identified and amend if necessary.

View Article and Find Full Text PDF

Similar Publications

Concurrent Validity and Reliability of In-Person and Supervised Remote STEADI Fall Risk Assessment in Community-Dwelling Older Adults.

J Geriatr Phys Ther

January 2025

Department of Physical Therapy, University of St. Augustine for Health Sciences, St. Augustine, Florida.

Amie Jasper Rania Karim Arvie C Vitente Carmina Minnie Rafael Eleazar Tayag

Background And Purpose: Physical therapists play a vital role in preventing and managing falls in older adults. With advancements in digital health and technology, community fall prevention programs need to adopt valid and reliable telehealth-based assessments. The purpose of this study was to evaluate the validity and reliability of the telehealth-based timed up and go (TUG) test, 30-second chair stand test (30s-CST), and four-stage (4-stage) balance test as functional components of the Stopping Elderly Accidents, Deaths, and Injuries (STEADI) fall risk assessment.

View Article and Find Full Text PDF

Similar Publications

Novel approach for noninvasive pelvic floor muscle strength measurement using extracorporeal surface perineal pressure measurement and machine learning modeling.

Digit Health

January 2025

Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Jongno-gu, Seoul, Korea.

Ui-Jae Hwang Sun-Hee Ahn Hyeon-Ju Lee Yurin Jeon Myung Jae Jeon

Objective: Accurate measurement of pelvic floor muscle (PFM) strength is crucial for the management of pelvic floor disorders. However, the current methods are invasive, uncomfortable, and lack standardization. This study aimed to introduce a novel noninvasive approach for precise PFM strength quantification by leveraging extracorporeal surface perineal pressure (ESPP) measurements and machine learning algorithms.

View Article and Find Full Text PDF

Similar Publications

Reliability of the Alberta Infant Motor Scale (AIMS) When Used via Telehealth for Neurodevelopmentally High-Risk Infants.

Phys Occup Ther Pediatr

January 2025

Physiotherapy Program, School of Health Sciences, Western Sydney University, NSW, Australia.

Serena Davies Barbara R Lucas Genevieve M Dwyer

Aims: To assess the reliability of the Alberta Infant Motor Scale (AIMS) when conducted recorded telehealth sessions by novice and expert raters.

Methods: Ten assessors (six novice, four expert) independently rated recorded telehealth assessments of 23 neurodevelopmentally high-risk infants twice. Inter- and intra-rater reliability of subscale scores, total score and percentile rankings were determined.

View Article and Find Full Text PDF

Similar Publications

Handheld dynamometry testing during dialysis: intra and inter-rater reliability study.

J Ren Nutr

January 2025

Department of Nursing and Physiotherapy, Universidad Cardenal Herrera-CEU, CEU Universities, C/ Santiago Ramón y Cajal s/n 46115, Alfara del Patriarca, Valencia, Spain.

Eva Segura-Orti Anna Junqué-Jiménez Valentin Dubuis Maycon M Reboredo Vicente Benavent-Caballer

Objective: The aim was to assess the intra and inter-rater reliability of the handheld dynamometry testing of lower limb muscles during hemodialysis.

Methods: This is a cross-sectional study including subjects undertaking hemodialysis for at least 3 months. Handheld dynamometer measurements of hip and ankle muscle strength (N) were registered on 4 different occasions, 2 trials by raters A and 2 by raters B, to evaluate the intra and inter-rater reliability.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!