Estimation and inference are two key components toward the solution of any statistical problem; however, the inferential issues of statistical assessment of agreement among two or more raters have not been well developed as compared to the development of estimation procedures in this area. The fundamental reason for this gap is the complex expression of the concordance correlation coefficient (CCC) that is frequently used in assessing agreement among raters. Large sample-based statistical tests for CCC often fail to produce desired results for small samples. Hence, inferential procedures for small samples are urgently needed to evaluate agreement between raters. We argue that hypothesis testing of CCC has little value in practice due to the absence of a gold standard of agreement. In this article, we construct the generalized confidence interval (GCI) for CCC utilizing a bivariate normal distribution of measurements, and also develop a large sample-based confidence interval (LSCI). We establish satisfactory performance of GCI by providing the desired coverage probability (CP) via simulation. Results of GCI and LSCI are illustrated and compared with a data set of a recent study performed at U.S. Department of Veterans Affairs, Hines.

Download full-text PDF

Source
http://dx.doi.org/10.1002/sim.8899DOI Listing

Publication Analysis

Top Keywords

agreement raters
16
confidence interval
12
generalized confidence
8
large sample-based
8
small samples
8
agreement
5
interval agreement
4
raters
4
raters estimation
4
estimation inference
4

Similar Publications

Evaluation of a Deep Learning Denoising Algorithm for Dose Reduction in Whole-Body Photon-Counting CT Imaging: A Cadaveric Study.

Acad Radiol

January 2025

Department of Radiology, University Hospital Tuebingen, Hoppe-Seyler-Str. 3, 72076 Tuebingen, Germany (R.D., J.M.B., B.S., J.M., S.G., P.K., S.W., J.H., K.N., S.A., A.B.).

Rationale And Objectives: Photon Counting CT (PCCT) offers advanced imaging capabilities with potential for substantial radiation dose reduction; however, achieving this without compromising image quality remains a challenge due to increased noise at lower doses. This study aims to evaluate the effectiveness of a deep learning (DL)-based denoising algorithm in maintaining diagnostic image quality in whole-body PCCT imaging at reduced radiation levels, using real intraindividual cadaveric scans.

Materials And Methods: Twenty-four cadaveric human bodies underwent whole-body CT scans on a PCCT scanner (NAEOTOM Alpha, Siemens Healthineers) at four different dose levels (100%, 50%, 25%, and 10% mAs).

View Article and Find Full Text PDF

Purpose: Arthrogenic muscle inhibition (AMI) is a reflexive shutdown of the quadriceps muscles following a knee injury or surgery that presents with or without hamstring contracture. This complication can be classified according to the SANTI classification, but the reproducibility of this clinical classification has not yet been demonstrated.

Methods: This single-centre longitudinal observational study included 140 patients who were within 6 weeks of an ACL rupture.

View Article and Find Full Text PDF

Background: Limitations in thumb radial abduction (i.e., carpometacarpal extension) are commonly experienced by persons with thumb carpometacarpal osteoarthritis.

View Article and Find Full Text PDF

Background Context: Clinical outcomes are directly related to patient selection and treatment indications for improved quality of life. With emphasis on quality and value, it is essential that treatment recommendations are optimized.

Purpose: The purpose of the North American Spine Society (NASS) Appropriate Use Criteria (AUC) is to determine the appropriate (ie, reasonable) multidisciplinary treatment recommendations for patients with metastatic neoplastic vertebral fractures across a spectrum of more common clinical scenarios.

View Article and Find Full Text PDF

Introduction: This study examines the ability of human readers, recurrence quantification analysis (RQA), and an online artificial intelligence (AI) detection tool (GPTZero) to distinguish between AI-generated and human-written personal statements in physical therapist education program applications.

Review Of Literature: The emergence of large language models such as ChatGPT and Google Gemini has raised concerns about the authenticity of personal statements. Previous studies have reported varying degrees of success in detecting AI-generated text.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!