An investigation into the risk of population bias in deep learning autocontouring.

Radiother Oncol

Mirada Medical Ltd, Oxford, United Kingdom; Inpictura Ltd, Oxford, United Kingdom. Electronic address:

Published: September 2023

Background And Purpose: To date, data used in the development of Deep Learning-based automatic contouring (DLC) algorithms have been largely sourced from single geographic populations. This study aimed to evaluate the risk of population-based bias by determining whether the performance of an autocontouring system is impacted by geographic population.

Materials And Methods: 80 Head Neck CT deidentified scans were collected from four clinics in Europe (n = 2) and Asia (n = 2). A single observer manually delineated 16 organs-at-risk in each. Subsequently, the data was contoured using a DLC solution, and trained using single institution (European) data. Autocontours were compared to manual delineations using quantitative measures. A Kruskal-Wallis test was used to test for any difference between populations. Clinical acceptability of automatic and manual contours to observers from each participating institution was assessed using a blinded subjective evaluation.

Results: Seven organs showed a significant difference in volume between groups. Four organs showed statistical differences in quantitative similarity measures. The qualitative test showed greater variation in acceptance of contouring between observers than between data from different origins, with greater acceptance by the South Korean observers.

Conclusion: Much of the statistical difference in quantitative performance could be explained by the difference in organ volume impacting the contour similarity measures and the small sample size. However, the qualitative assessment suggests that observer perception bias has a greater impact on the apparent clinical acceptability than quantitatively observed differences. This investigation of potential geographic bias should extend to more patients, populations, and anatomical regions in the future.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.radonc.2023.109747DOI Listing

Publication Analysis

Top Keywords

clinical acceptability
8
similarity measures
8
investigation risk
4
risk population
4
bias
4
population bias
4
bias deep
4
deep learning
4
learning autocontouring
4
autocontouring background
4

Similar Publications

Background: The aim was to assess whether the postoperative Oxford Hip Score (OHS) demonstrated a ceiling effect at 1 or 2 years after total hip arthroplasty (THA) and to identify which patients are more likely to achieve a ceiling score and whether this limits assessment of their outcome.

Methods: A retrospective cohort of 7871 patients undergoing primary THA was identified from an established arthroplasty database. Patient demographics, ASA grade, socioeconomic status, OHS and EuroQol questionnaire were collected preoperatively and at 1 and 2 years postoperatively.

View Article and Find Full Text PDF

Patients with hard-to-heal wounds: a review and synthesis of their experiences and perceptions of maggot debridement.

J Wound Care

January 2025

Alice Lee Centre for Nursing Studies, Yong Loo Lin School of Medicine, National University of Singapore, Clinical Research Centre, Singapore.

Objective: There is little use of maggot debridement therapy (MDT) worldwide, albeit there is much literature supporting its benefits and effectiveness for hard-to-heal (chronic) wounds. Hard-to-heal wounds are becoming ever more prevalent and MDT can play a pivotal role in wound care management. This underuse can be associated with patients' perceptions and experiences of MDT.

View Article and Find Full Text PDF

Objective: This study aimed to evaluate the performance of an innovative multicomponent compression system in a single bandage (UrgoK1, Laboratoires Urgo, France) in the treatment of patients with venous leg ulcers (VLUs) and/or lower limb oedema in everyday practice.

Method: A prospective, observational, clinical study with the evaluated compression system was conducted in 39 centres in Germany between March 2022 and July 2023. Main outcomes included a description of the treated patients, changes in wound healing and oedema progression, local tolerance and acceptability of the compression system.

View Article and Find Full Text PDF

Objective: Many people with inflammatory bowel disease (IBD) experience fatigue, pain and faecal incontinence that some feel are inadequately addressed. It is unknown how many have potentially reversible medical issues underlying these symptoms.

Methods: We conducted a study testing the feasibility of a patient-reported symptom checklist and nurse-administered management algorithm ('Optimise') to manage common medical causes of IBD-related fatigue, pain and faecal incontinence.

View Article and Find Full Text PDF

Background: There is a global need for synthetic speech development in multiple languages and dialects, as many children who cannot communicate using their natural voice struggle to find synthetic voices on high-technology devices that match their age, social and linguistic background.

Aims: To document multiple stakeholders' perspectives surrounding the quality, acceptability and utility of newly created synthetic speech in three under-resourced South African languages, namely South African English, Afrikaans and isiXhosa.

Methods & Procedures: A mixed methods research design was selected.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!