Inter-rater reliability data of classroom observation: Fidelity in large-scale randomized research in education.

Data Brief

Center for Research and Development in Dual Language and Literacy Acquisition, Department of Educational Psychology, Texas A&M University, College Station, TX 77843, United States.

Published: April 2020

AI Article Synopsis

  • This dataset stems from a large-scale randomized controlled trial focused on improving English learning for students and enhancing teachers' instructional skills.
  • It consists of classroom observations where coders rated video segments of ESL classes, evaluating six instructional domains within 20-second intervals.
  • The dataset is structured for detailed analysis of inter-rater reliability among different coders and domains, supporting research on observational measures in educational contexts.

Article Abstract

This dataset belongs to a large-scale randomized controlled trial (RCT) in educational research targeting English learning students and their teachers' instructional capacity. The dataset includes ratings conducted through classroom observations of 45-minute English as a Second language (ESL) blocks. Each coder rated 60 recorded video segments collected from each teacher. During the 20-second segment, ratings of six domains of teachers' instruction (i.e., ESL Strategies, Group, Activity Structure, Mode, Language Content, Language of Teacher, Language of Student) were collected. The dataset is organized by teacher, by coder, and by domain, for researchers to analyze inter-rater reliability among coders by domain and/or cross-domain. This data article is related to the research article Tong et al. [3] on "The determination of appropriate coefficient indices for inter-rater reliability: using classroom observation instruments as fidelity measures in large-scale randomized research".

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7044508PMC
http://dx.doi.org/10.1016/j.dib.2020.105303DOI Listing

Publication Analysis

Top Keywords

inter-rater reliability
12
large-scale randomized
12
classroom observation
8
reliability data
4
data classroom
4
observation fidelity
4
fidelity large-scale
4
randomized education
4
education dataset
4
dataset belongs
4

Similar Publications

Validity and Reliability of the Videoconference-Based Berg Balance Scale in Stroke Survivors: The Tele-Berg Balance Scale.

J Neurol Phys Ther

December 2024

Department of Physical Therapy, Motor Control Laboratory (LADECOM), Centre of Healthy and Sport Sciences, University of Santa Catarina State, Florianópolis, Santa Catarina, Brazil.

Background And Purpose: Telerehabilitation represents an alternative for individuals who have difficulty accessing services to receive care. Therefore, telerehabilitation measures must be studied for their reliability and validity. This study evaluated the validity and reliability of the videoconference-based Berg Balance Scale assessment in stroke survivors.

View Article and Find Full Text PDF

Introduction: This study aimed to develop and validate an aesthetic grading tool (AGT) for bilateral DIEP flap breast reconstruction and investigate the correlation of BREAST-Q scores with perceived aesthetic outcomes.

Methods: The AGT utilized a Likert scale to rate aesthetic outcomes based on photographs of post-reconstruction breasts. The validation involved iterative testing with healthcare providers and patients.

View Article and Find Full Text PDF

Objective: To test the Intra- and inter-rater reliability, measurement error and criteria and convergent validities of the Dualpex Plus (DP) for vaginal manometry in women with urinary incontinence (UI).

Design: This is a clinimetric properties study.

Setting: University Hospital in Brazil.

View Article and Find Full Text PDF

Reliability agreement in foul and penalty judgements between officials in the Swedish hockey league.

Front Sports Act Living

December 2024

Department of Health Sciences, Swedish Winter Sports Research Centre, Mid Sweden University, Östersund, Sweden.

Introduction: Officials are essential in terms of player safety and injury prevention, especially in contact team sports such as ice hockey, where numerous fast pace and high force contacts occur. If against the rules, these collisions can result in penalties. However, there is limited literature on the inter-rater reliability of the officials' decisions.

View Article and Find Full Text PDF

Background: Behavioural marker systems are used across several healthcare disciplines to assess behavioural (non-technical) skills, but rater training is variable, and inter-rater reliability is generally poor. Inter-rater reliability provides data about the tool, but not the competence of individual raters. This study aimed to test the inter-rater reliability of a new behavioural marker system (PhaBS - pharmacists' behavioural skills) with clinically experienced faculty raters and near-peer raters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!