The effect of measurement unreliability on sleep and respiratory variables.

Sleep

Health Services Research & Development Service, Veterans Affairs San Diego Healthcare System, San Diego, CA 92161, USA.

Published: August 2004

Introduction: Unreliability associated with scoring sleep variables is a potentially problematic issue in clinical and research studies. When scoring unreliability is unrecognized, it can contribute to the following: increase variability in the measures of interest, decrease a study's ability to detect important relationships, attenuate correlation coefficients, and increase clinical trial costs.

Methods: This paper first models the relationship between scoring variability and reliability in commonly studied sleep variables. The paper then models the relationship between unreliability and sample size requirements and statistical power. Standard methods are used to model reliability using the intraclass correlation coefficient.

Results: The analysis shows that when scoring unreliability is minimized (i.e., scoring reliability is maximized), correlation coefficients are more robust, sample size requirements are reduced, statistical power is increased, and clinical trial costs are reduced.

Discussion: When scoring unreliability is recognized, research studies can compensate by increasing the number of research subjects studied; however, it is at the cost of increasing the costs of research and exposing greater numbers of subjects to possible study risks. An effective solution is to implement rigorous initial and ongoing training efforts to maintain high inter-rater and intra-rater reliability coefficients.

Download full-text PDF

Source
http://dx.doi.org/10.1093/sleep/27.5.990DOI Listing

Publication Analysis

Top Keywords

scoring unreliability
12
sleep variables
8
correlation coefficients
8
clinical trial
8
paper models
8
models relationship
8
sample size
8
size requirements
8
statistical power
8
scoring
6

Similar Publications

Objective: Large language models (LLMs) have recently gained attention for clinical decision-making and diagnosis. This study evaluates the performance of the recently updated LLM (ChatGPT-4o) in predicting clinical outcomes in patients with status epilepticus and compares its prognostic performance to the Status Epilepticus Severity Score (STESS).

Methods: This retrospective single-center cohort study was performed at the University Hospital Basel (tertiary academic medical center) from January 2005 to December 2022.

View Article and Find Full Text PDF

Background: Interactive artificial intelligence tools such as ChatGPT have gained popularity, yet little is known about their reliability as a reference tool for healthcare-related information for healthcare providers and trainees. The objective of this study was to assess the consistency, quality, and accuracy of the responses generated by ChatGPT on healthcare-related inquiries.

Methods: A total of 18 open-ended questions including six questions in three defined clinical areas (2 each to address "what", "why", and "how", respectively) were submitted to ChatGPT v3.

View Article and Find Full Text PDF

Purpose: This study aimed to compare the effectiveness of 3 artificial intelligence language models-GPT-3.5, GPT-4o, and Gemini, in delivering patient-centered information about thyroid eye disease (TED). We evaluated their performance based on the accuracy and comprehensiveness of their responses to common patient inquiries regarding TED.

View Article and Find Full Text PDF

A Comprehensive Stereology Method for Quantitative Evaluation of Neuronal Injury, Neurodegeneration, and Neurogenesis in Brain Disorders.

Curr Protoc

December 2024

Department of Neuroscience and Experimental Therapeutics, Texas A&M University Health Science Center, College of Medicine, Bryan, Texas.

Neuronal injury, neurodegeneration, and neuroanatomical changes are key pathological features of various neurological disorders, including epilepsy, stroke, traumatic brain injury, Parkinson's disease, autism, and Alzheimer's disease. Accurate quantification of neurons and interneurons in different brain regions is critical for understanding the progression of neurodegenerative disorders in animal models. Traditional scoring methods are often superficial, biased, and unreliable for evaluating neuropathology.

View Article and Find Full Text PDF

Accurate multi-object tracking (MOT) is essential for autonomous vehicles, enabling them to perceive and interact with dynamic environments effectively. Single-modality 3D MOT algorithms often face limitations due to sensor constraints, resulting in unreliable tracking. Recent multi-modal approaches have improved performance but rely heavily on complex, deep-learning-based fusion techniques.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!