Obstructive sleep apnea diagnosis is based on the manual scoring of respiratory events. The agreement in the manual scoring of the respiratory events lacks an in-depth investigation as most of the previous studies reported only the apnea-hypopnea index or overall agreement, and not temporal, second-by-second or event subtype agreement. We hypothesized the temporal and subtype agreement to be low because the event duration or subtypes are not generally considered in current clinical practice. The data comprised 50 polysomnography recordings scored by 10 experts. The respiratory event agreement between the scorers was calculated using kappa statistics in a second-by-second manner. Obstructive sleep apnea severity categories (no obstructive sleep apnea/mild/moderate/severe) were compared between scorers. The Fleiss' kappa value for binary (event/no event) respiratory event scorings was 0.32. When calculated separately within N1, N2, N3 and R, the Fleiss' kappa values were 0.12, 0.23, 0.22 and 0.23, respectively. Binary analysis conducted separately for the event subtypes showed the highest Fleiss' kappa for hypopneas to be 0.26. In 34% of the participants, the obstructive sleep apnea severity category was the same regardless of the scorer, whereas in the rest of the participants the category changed depending on the scorer. Our findings indicate that the agreement of manual scoring of respiratory events depends on the event type and sleep stage. The manual scoring has discrepancies, and these differences affect the obstructive sleep apnea diagnosis. This is an alarming finding, as ultimately these differences in the scorings affect treatment decisions.

Download full-text PDF

Source
http://dx.doi.org/10.1111/jsr.14391DOI Listing

Publication Analysis

Top Keywords

manual scoring
20
obstructive sleep
20
scoring respiratory
16
respiratory events
16
sleep apnea
16
agreement manual
12
fleiss' kappa
12
apnea diagnosis
8
subtype agreement
8
respiratory event
8

Similar Publications

Longitudinal trajectories of digital upper limb biomarkers for multiple sclerosis.

Eur J Neurol

January 2025

Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia.

Background: Upper limb dysfunction is a common debilitating feature of relapsing-remitting multiple sclerosis (RRMS). We aimed to examine the longitudinal trajectory of the iPad®-based Manual Dexterity Test (MDT) and predictors of change over time.

Methods: We prospectively enrolled RRMS patients (limited to Expanded Disability Status Scale (EDSS) < 4).

View Article and Find Full Text PDF

The transition period from automation to manual, known as the takeover process, presents challenges for drivers due to the deficiency in collecting requisite contextual information. The current study collected drivers' eye movement in a simulated takeover experiment, and their Situation Awareness (SA) was assessed using the Situation Awareness Global Assessment Technique (SAGAT) method. The drivers' Stationary Gaze Entropy (SGE) was calculated based on the percentages of time they spent on six pre-defined Areas of Interests (AOIs).

View Article and Find Full Text PDF

Background: Hip dysplasia (HD) is characterized by insufficient acetabular coverage of the femoral head, leading to a predisposition for osteoarthritis. While radiographic measurements such as the lateral center edge angle (LCEA) and Tönnis angle are essential in evaluating HD severity, patient-reported outcome measures (PROMs) offer insights into the subjective health impact on patients.

Aim: To investigate the correlations between machine-learning automated and manual radiographic measurements of HD and PROMs with the hypothesis that artificial intelligence (AI)-generated HD measurements indicating less severe dysplasia correlate with better PROMs.

View Article and Find Full Text PDF

Engineered feature embeddings meet deep learning: A novel strategy to improve bone marrow cell classification and model transparency.

J Pathol Inform

December 2024

Computer Imaging and Medical Application Laboratory, Universidad Nacional de Colombia, Bogotá 111321, Colombia.

Cytomorphology evaluation of bone marrow cell is the initial step to diagnose different hematological diseases. This assessment is still manually performed by trained specialists, who may be a bottleneck within the clinical process. Deep learning algorithms are a promising approach to automate this bone marrow cell evaluation.

View Article and Find Full Text PDF

Background: Generative artificial intelligence (AI) models that can produce photorealistic images from text descriptions have many applications in medicine, including medical education and the generation of synthetic data. However, it can be challenging to evaluate their heterogeneous outputs and to compare between different models. There is a need for a systematic approach enabling image and model comparisons.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!