Evaluating observer agreement of scoring systems for foot integrity and footrot lesions in sheep.

BMC Vet Res

Quantitative Veterinary Epidemiology group, Wageningen Institute of Animal Sciences, Wageningen University, Wageningen, The Netherlands.

Published: May 2012

AI Article Synopsis

  • The study investigates observer agreement in scoring footrot and foot integrity in sheep using both a 5-point and a 4-point ordinal scale.
  • Observers showed more consistency in their scores when assessing their own observations compared to scoring against other observers, but disagreements were identified due to bias and varying threshold interpretations for certain scores.
  • The results indicate that using photographs led to more reliable scores than using video clips or physical foot specimens, highlighting the effectiveness of latent class modeling in analyzing scoring differences.

Article Abstract

Background: A scoring scale with five ordinal categories is used for visual diagnosis of footrot in sheep and to study its epidemiology and control. More recently a 4 point ordinal scale has been used by researchers to score foot integrity (wall and sole horn damage) in sheep. There is no information on observer agreement using either of these scales. Observer agreement for ordinal scores is usually estimated by single measure values such as weighted kappa or Kendall's coefficient of concordance which provide no information where the disagreement lies. Modeling techniques such as latent class models provide information on both observer bias and whether observers have different thresholds at which they change the score given. In this paper we use weighted kappa and located latent class modeling to explore observer agreement when scoring footrot lesions (using photographs and videos) and foot integrity (using post mortem specimens) in sheep. We used 3 observers and 80 photographs and videos and 80 feet respectively.

Results: Both footrot and foot integrity scoring scales were more consistent within observers than between. The weighted kappa values between observers for both footrot and integrity scoring scales ranged from moderate to substantial. There was disagreement between observers with both observer bias and different thresholds between score values. The between observer thresholds were different for scores 1 and 2 for footrot (using photographs and videos) and for all scores for integrity (both walls and soles). The within observer agreement was higher with weighted kappa values ranging from substantial to almost perfect. Within observer thresholds were also more consistent than between observer thresholds. Scoring using photographs was less variable than scoring using video clips or feet.

Conclusions: Latent class modeling is a useful method for exploring components of disagreement within and between observers and this information could be used when developing a scoring system to improve reliability.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428656PMC
http://dx.doi.org/10.1186/1746-6148-8-65DOI Listing

Publication Analysis

Top Keywords

observer agreement
20
foot integrity
16
weighted kappa
16
latent class
12
photographs videos
12
observer thresholds
12
observer
9
scoring
8
agreement scoring
8
footrot lesions
8

Similar Publications

Prognostic significance of serum complement activation, neutrophil extracellular traps and extracellular DNA in newly diagnosed epithelial ovarian cancer.

Gynecol Oncol

January 2025

Departments of Internal Medicine and Immunology, Roswell Park Comprehensive Cancer Center, Buffalo, NY, United States of America; Department of Medicine, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States of America.

Purpose: We observed that the tumor microenvironment (TME) in metastatic epithelial ovarian cancer (EOC) and in other solid tumors can reprogram normal neutrophils to acquire a complement-dependent suppressor phenotype characterized by inhibition of stimulated T cell activation. This study aims to evaluate whether serum markers of neutrophil activation and complement at diagnosis of EOC would be associated with clinical outcomes.

Experimental Design: We conducted a two-center prospective study of patients with newly diagnosed EOC (N = 188).

View Article and Find Full Text PDF

Purpose: Acute fatty liver of pregnancy (AFLP) is a severe complication that can occur in the third trimester or immediately postpartum, characterized by rapid hepatic failure. This study aims to explore the changes in portal vein blood flow velocity and liver function during pregnancy, which may assist in the early diagnosis and management of AFLP.

Methods: This longitudinal study was conducted at a tertiary healthcare center with participants recruited from routine antenatal check-ups.

View Article and Find Full Text PDF

Purpose: Reliable image quality assessment is crucial for evaluating new motion correction methods for magnetic resonance imaging. In this work, we compare the performance of commonly used reference-based and reference-free image quality metrics on a unique dataset with real motion artifacts. We further analyze the image quality metrics' robustness to typical pre-processing techniques.

View Article and Find Full Text PDF

In this paper, we attempt to answer two questions: 1) which regions of the human brain, in terms of morphometry, are most strongly related to individual differences in domain-general cognitive functioning ( )? and 2) what are the underlying neurobiological properties of those regions? We meta-analyse vertex-wise -cortical morphometry (volume, surface area, thickness, curvature and sulcal depth) associations using data from 3 cohorts: the UK Biobank (UKB), Generation Scotland (GenScot), and the Lothian Birth Cohort 1936 (LBC1936), with the meta-analytic = 38,379 (age range = 44 to 84 years old). These morphometry associations vary in magnitude and direction across the cortex (|β| range = -0.12 to 0.

View Article and Find Full Text PDF

Electronic health records (EHR) are increasingly used in public health research. However, biases may exist when using EHR due to whether someone is captured in the data. Assessing the impact of bias in generating disparities identified with EHR data is difficult because information about healthcare-seeking behaviors is not included in the record.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!