Background: Studies have reported that performance status (PS) is a good prognostic indicator in patients with advanced cancer. However, different health care professionals (HCPs) could grade PS differently. The purpose of this review is to investigate the PS scores evaluated by different HCPs as reported in the literature.

Methods: A literature search was conducted in Ovid MEDLINE and OLDMEDLINE from 1946 to Present (July 5, 2015), Embase Classic and Embase from 1947 to 2015 Week 26, and Cochrane Central Register of Controlled Trials up to May 2015. Information of interest was whether there was a difference of PS assessment between HCPs. Other statistical information provided to assess the agreement in ratings, such as Cohen's kappa coefficient, Krippendorff's alpha coefficient, Spearman Rank Coefficient, and Kendall's correlation, was noted.

Results: Of the fifteen articles, eleven compared PS assessments between HCPs of different disciplines, one between the attending and resident physician, two between similarly-specialized physicians, and one between two unspecified-specialty physicians. Three studies reported a lack of agreement (kappa =0.19-0.26; Krippendorff's alpha =0.61-0.63), four reported moderate inter-rater reliability (kappa =0.31-0.72), two reported mixed reliability, and six reported strong reliability (kappa =0.91-0.92; Spearman rank correlation =0.6-1.0; Kendall's correlation =0.75-0.82). Four studies reported that Karnofsky performance status (KPS) had better inter-rater reliability than both the Eastern Cooperative Oncology Group Performance Status (ECOG PS) and the palliative performance scale (PPS).

Conclusions: The existing literature cites both good and bad inter-rater reliability of PS scores. It is difficult to conclude which HCPs' PS assessments are more accurate.

Download full-text PDF

Source
http://dx.doi.org/10.21037/apm.2016.03.02DOI Listing

Publication Analysis

Top Keywords

inter-rater reliability
16
performance status
16
studies reported
12
health care
8
care professionals
8
krippendorff's alpha
8
spearman rank
8
kendall's correlation
8
reliability kappa
8
reported
7

Similar Publications

Using artificial intelligence to evaluate adherence to best practices in one anastomosis gastric bypass: first steps in a real-world setting.

Surg Endosc

January 2025

Division of General Surgery, Bariatric Unit, Tel Aviv Medical Center, Affiliated to Sackler Faculty of Medicine, Tel Aviv University, 6, Weizman St, 6423906, Tel- Aviv, Israel.

Background: Safety in one anastomosis gastric bypass (OAGB) is judged by outcomes, but it seems reasonable to utilize best practices for safety, whose performance can be evaluated and therefore improved. We aimed to test an artificial intelligence-based model in real world for the evaluation of adherence to best practices in OAGB.Please check and confirm that the authors and their respective affiliations have been correctly identified and amend if necessary.

View Article and Find Full Text PDF

Background And Purpose: Physical therapists play a vital role in preventing and managing falls in older adults. With advancements in digital health and technology, community fall prevention programs need to adopt valid and reliable telehealth-based assessments. The purpose of this study was to evaluate the validity and reliability of the telehealth-based timed up and go (TUG) test, 30-second chair stand test (30s-CST), and four-stage (4-stage) balance test as functional components of the Stopping Elderly Accidents, Deaths, and Injuries (STEADI) fall risk assessment.

View Article and Find Full Text PDF

Objective: Accurate measurement of pelvic floor muscle (PFM) strength is crucial for the management of pelvic floor disorders. However, the current methods are invasive, uncomfortable, and lack standardization. This study aimed to introduce a novel noninvasive approach for precise PFM strength quantification by leveraging extracorporeal surface perineal pressure (ESPP) measurements and machine learning algorithms.

View Article and Find Full Text PDF

Aims: To assess the reliability of the Alberta Infant Motor Scale (AIMS) when conducted recorded telehealth sessions by novice and expert raters.

Methods: Ten assessors (six novice, four expert) independently rated recorded telehealth assessments of 23 neurodevelopmentally high-risk infants twice. Inter- and intra-rater reliability of subscale scores, total score and percentile rankings were determined.

View Article and Find Full Text PDF

Handheld dynamometry testing during dialysis: intra and inter-rater reliability study.

J Ren Nutr

January 2025

Department of Nursing and Physiotherapy, Universidad Cardenal Herrera-CEU, CEU Universities, C/ Santiago Ramón y Cajal s/n 46115, Alfara del Patriarca, Valencia, Spain.

Objective: The aim was to assess the intra and inter-rater reliability of the handheld dynamometry testing of lower limb muscles during hemodialysis.

Methods: This is a cross-sectional study including subjects undertaking hemodialysis for at least 3 months. Handheld dynamometer measurements of hip and ankle muscle strength (N) were registered on 4 different occasions, 2 trials by raters A and 2 by raters B, to evaluate the intra and inter-rater reliability.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!