We examined the interrater reliability and generalizability of high-frequency oscillation (HFO) visual evaluations in the ripple (80-250 Hz) band, and established a framework for the transition of HFO analysis to routine clinical care. We were interested in the interrater reliability or epoch generalizability to describe how similar the evaluations were between reviewers, and in the reviewer generalizability to represent the consistency of the internal threshold each individual reviewer. We studied 41 adult epilepsy patients (mean age: 35.6 years) who underwent intracranial electroencephalography. A morphology detector was designed and used to detect candidate HFO events, lower-threshold events, and distractor events. These events were subsequently presented to six expert reviewers, who visually evaluated events for the presence of HFOs. Generalizability theory was used to characterize the epoch generalizability (interrater reliability) and reviewer generalizability (internal threshold consistency) of visual evaluations, as well as to project the numbers of epochs, reviewers, and datasets required to achieve strong generalizability (threshold of 0.8). The reviewer generalizability was almost perfect (0.983), indicating there were sufficient evaluations to determine the internal threshold of each reviewer. However, the interrater reliability for 6 reviewers (0.588) and pairwise interrater reliability (0.322) were both poor, indicating that the agreement of 6 reviewers is insufficient to reliably establish the presence or absence of individual HFOs. Strong interrater reliability (≥0.8) was projected as requiring a minimum of 17 reviewers, while strong reviewer generalizability could be achieved with <30 epoch evaluations per reviewer. This study reaffirms the poor reliability of using small numbers of reviewers to identify HFOs, and projects the number of reviewers required to overcome this limitation. It also provides a set of tools which may be used for training reviewers, tracking changes to interrater reliability, and for constructing a benchmark set of epochs that can serve as a generalizable gold standard, against which other HFO detection algorithms may be compared. This study represents an important step toward the reconciliation of important but discordant findings from HFO studies undertaken with different sets of HFOs, and ultimately toward transitioning HFO analysis into a meaningful part of the clinical epilepsy workup.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6031752PMC
http://dx.doi.org/10.3389/fneur.2018.00510DOI Listing

Publication Analysis

Top Keywords

interrater reliability
24
reviewer generalizability
16
internal threshold
12
generalizability
10
evaluations ripple
8
visual evaluations
8
epoch generalizability
8
threshold reviewer
8
interrater
6
reliability
6

Similar Publications

Purpose: This study investigates mental health-related content to delineate potentially deficient topics for improvement in future obstetrics and gynecology (OBGYN) resident educational curriculum initiatives.

Method: In this quantitative content analysis, educational resources commonly used by OBGYN residents were selected based on a 2020 multi-institutional survey of OBGYN residents and informal group discussion with 32 OBGYN residents from a New York academic institution in April 2020. After independent screening, the authors iteratively developed, tested, and implemented a coding scheme for relevant keywords.

View Article and Find Full Text PDF

Objectives: POCUS is a core emergency medicine skill and mainstay of early pregnancy assessment. The ultrasound competency assessment tool was developed as an entrustment-based assessment tool for use by content experts evaluating trainees performing multiple POCUS study types. The objective of this study was to evaluate the scoring and extrapolation inferences of the tool within Kane's validity framework when used to assess trainees performing an early pregnancy POCUS.

View Article and Find Full Text PDF

Background: Three-dimensional (3D) imaging enhances surgical planning and documentation in plastic surgery, but high costs limit accessibility. Mobile Light Detection and Ranging (LiDAR) technology offers a potential cost-effective alternative.

Objectives: To evaluate the accuracy and clinical utility of iPhone-based LiDAR scanning for breast measurements compared to traditional methods, and to establish standardized protocols for clinical implementation.

View Article and Find Full Text PDF

The Teitge test.

Dan Med J

November 2024

Sports Orthopedic Research Center - Copenhagen (SORC-C), Department of Orthopedic Surgery, Copenhagen University Hospital - Amager and Hvidovre Hospital, Denmark.

Introduction: High tibial osteotomy (HTO) is used to treat medial knee osteoarthritis (OA). A simple clinical test to select the patients most likely to benefit from the procedure was suggested by R. A.

View Article and Find Full Text PDF

Background: The Motricity Index (MI) is a commonly used method of measuring muscle strength in post-stroke hemiparesis. This study aimed to produce the MI Italian version (MI-IT) and assess its reliability in subjects with stroke.

Methods: Phase-1: stepwise approach to MI-IT production and pilot-testing with 10 health professionals to ensure clarity of each item and instructions for administration and scoring.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!