On understanding reliability for diagnostic tests.

Interv Pain Med

The University of Newcastle, PO Box 431, East Maitland, NSW, 2323, Australia.

Published: August 2022

For professional practice to be responsible, any diagnostic tests used must be reliable. Therefore, the reliability of any diagnostic test needs to have been measured. The classical statistic for quantifying reliability is Kappa. Although Kappa can be promptly determined using a programmed calculator, using an algorithm to derive Kappa provides greater insight into what it is actually measuring and why. Kappa scores can be graded, with verbal descriptor applied to different grades. However, those descriptors do not necessarily reflect the degree of skill required to achieve different grades of Kappa. High levels of skill attract high Kappa scores, but Kappa scores described as fair or moderate are not necessarily flattering because they can be achieved with questionable levels of skill. Various corrections can be applied to the calculation of Kappa scores in order to raise their value, and to improve the verbal descriptors of their grade, but these may not be legitimate or necessary. Low Kappa scores do not condemn tests but they serve to raise questions about their reliability.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11372993	PMC
http://dx.doi.org/10.1016/j.inpm.2022.100124	DOI Listing

Publication Analysis

Top Keywords

kappa scores

kappa

reliability diagnostic

diagnostic tests

levels skill

scores

understanding reliability

tests professional

professional practice

practice responsible

Similar Publications

Sequential testing with Xpert MTB/RIF assay for diagnosis of tuberculous meningitis in Maharaj Nakorn Chiang Mai University Hospital.

Sci Rep

January 2025

Division of Infectious Diseases and Tropical Medicine, Department of Internal Medicine, Faculty of Medicine, Chiang Mai University, 110 Intavaroros Rd., Muaeng, Chiang Mai, 50200, Thailand.

Poramed Winichakoon Kanokwan Watcharasaksilp Sunisa Butphet Kanlaya Wongworapat Chansom Pantip

Early diagnosis and appropriate treatment are essential for reducing morbidity and mortality in tuberculous meningitis (TBM). This study aimed to evaluate the diagnostic performance of the Xpert MTB/RIF assay for the diagnosis of TBM in patients with subacute lymphocytic meningitis. This cross-sectional study included 65 cerebrospinal fluid (CSF) specimens from patients at Maharaj Nakorn Chiang Mai University Hospital, Thailand, between January 2015 and March 2016.

View Article and Find Full Text PDF

Similar Publications

The value of MRI in differentiating ovarian clear cell carcinoma from other adnexal masses with O-RADS MRI scores of 4-5.

Insights Imaging

January 2025

Department of Radiology, Shanghai First Maternity and Infant Hospital, School of Medicine, Tongji University, Shanghai, China.

Lingling Lin Le Fu Huawei Wu Saiming Cheng Guangquan Chen

Objective: To assess the utility of clinical and MRI features in distinguishing ovarian clear cell carcinoma (CCC) from adnexal masses with ovarian-adnexal reporting and data system (O-RADS) MRI scores of 4-5.

Methods: This retrospective study included 850 patients with indeterminate adnexal masses on ultrasound. Two radiologists evaluated all preoperative MRIs using the O-RADS MRI risk stratification system.

View Article and Find Full Text PDF

Similar Publications

Comparison between Global Leadership Initiative on Malnutrition criteria and protein-energy wasting in patients with kidney failure undergoing peritoneal dialysis.

J Ren Nutr

January 2025

Coordinación de Nutrición Clínica, Departamento de Áreas Críticas, Instituto Nacional de Enfermedades Respiratorias, Ciudad de México.

Gabriela Leal-Escobar Annabel Biruete Karla Berenice Cano-Escobar Magdalena Madero Ivan Armando Osuna-Padilla

Background: Protein-energy wasting (PEW) is the chronic kidney disease (CKD)-specific diagnosis encompassing malnutrition. PEW is associated with adverse outcomes, including those receiving peritoneal dialysis (PD). Identifying PEW requires accurate methods to improve diagnosis.

View Article and Find Full Text PDF

Similar Publications

Clinician-reported Gloucester Comfort Scale scores underestimate patient discomfort and pain during colonoscopy: insights from comparison with a patient-reported experience measure.

Endoscopy

January 2025

Gastroenterology and Hepatology, Amsterdam UMC Location VUmc, Amsterdam, Netherlands.

Querijn N E van Bokhorst Charmayne V Geerlings Manon van der Vlugt Karlijn J Nass Jos W Borkent

Background: Patient experience is a fundamental element of colonoscopy. The Gloucester Comfort Scale (GCS) is used by clinicians to report patient comfort. However, insights regarding the extent to which clinician-reported GCS scores represent the patient's experience are lacking.

View Article and Find Full Text PDF

Similar Publications

Quality assessment of clinical guidelines in the care of laryngitis and pharyngitis according to AGREE II.

Codas

January 2025

Departamento de Farmácia, Universidade de Brasília - UnB - Brasília (DF), Brasil.

Lucas Mateus Rodrigues Carvalho Ana Paula de Oliveira Barbosa Nara Amanda Laismann Débora Santos Lula Barros Rodrigo Fonseca Lima

Purpose: The study aimed to identify and assess the methodological quality of essential clinical guidelines for the management of laryngitis and pharyngitis.

Methods: A systematic search of clinical guidelines for the management of laryngitis and pharyngitis was performed in three databases. Methodological quality was assessed according to AGREE II, in which each item in its domains was scored by four independent evaluators.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!