Uncovering and Correcting Shortcut Learning in Machine Learning Models for Skin Cancer Diagnosis.

Diagnostics (Basel)

Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany.

Published: December 2021

Machine learning models have been successfully applied for analysis of skin images. However, due to the black box nature of such deep learning models, it is difficult to understand their underlying reasoning. This prevents a human from validating whether the model is right for the right reasons. Spurious correlations and other biases in data can cause a model to base its predictions on such artefacts rather than on the true relevant information. These learned shortcuts can in turn cause incorrect performance estimates and can result in unexpected outcomes when the model is applied in clinical practice. This study presents a method to detect and quantify this shortcut learning in trained classifiers for skin cancer diagnosis, since it is known that dermoscopy images can contain artefacts. Specifically, we train a standard VGG16-based skin cancer classifier on the public ISIC dataset, for which colour calibration charts (elliptical, coloured patches) occur only in benign images and not in malignant ones. Our methodology artificially inserts those patches and uses inpainting to automatically remove patches from images to assess the changes in predictions. We find that our standard classifier partly bases its predictions of benign images on the presence of such a coloured patch. More importantly, by artificially inserting coloured patches into malignant images, we show that shortcut learning results in a significant increase in misdiagnoses, making the classifier unreliable when used in clinical practice. With our results, we, therefore, want to increase awareness of the risks of using black box machine learning models trained on potentially biased datasets. Finally, we present a model-agnostic method to neutralise shortcut learning by removing the bias in the training dataset by exchanging coloured patches with benign skin tissue using image inpainting and re-training the classifier on this de-biased dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8774502PMC
http://dx.doi.org/10.3390/diagnostics12010040DOI Listing

Publication Analysis

Top Keywords

shortcut learning
16
learning models
16
machine learning
12
skin cancer
12
coloured patches
12
learning
8
cancer diagnosis
8
black box
8
clinical practice
8
benign images
8

Similar Publications

Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification.

Sensors (Basel)

January 2025

College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China.

Domain-generalizable re-identification (DG Re-ID) aims to train a model on one or more source domains and evaluate its performance on unseen target domains, a task that has attracted growing attention due to its practical relevance. While numerous methods have been proposed, most rely on discriminative or contrastive learning frameworks to learn generalizable feature representations. However, these approaches often fail to mitigate shortcut learning, leading to suboptimal performance.

View Article and Find Full Text PDF

Proficiency-chasing and goalodicy: In prioritising checklists, are we gambling with the future of mental health nursing?

Nurse Educ Today

January 2025

School of Nursing and Midwifery, University of Central Lancashire, United Kingdom of Great Britain and Northern Ireland. Electronic address:

In this discussion paper, I take a critical approach to the use of standardised checklists in practice assessment documents as a valid method of assessing mental health nursing students in the UK. The game Bingo is applied here as a metaphor, highlighting the folly of using standardised cross-field checklists to assess mental health nursing students in practice. Such practices, I argue, amount to little more than a game of proficiency-chasing at the expense of seeking more meaningful learning experiences, especially where practice assessment documents currently prioritise physical health care skills above those required for successful mental health nursing.

View Article and Find Full Text PDF

Nonpregnant and pregnant women who present with acute pelvic pain can pose a diagnostic challenge in the emergency setting. The clinical presentation is often nonspecific, and the differential diagnosis may be very broad. These symptoms are often indications for pelvic US, which is the primary imaging modality when an obstetric or gynecologic cause is suspected.

View Article and Find Full Text PDF

Introduction: Addressing physician burnout is critical for healthcare systems. As electronic health record (EHR) workload and teamwork have been identified as major contributing factors to physician well-being, we aimed to mitigate burnout through EHR-based interventions and a compassion team practice (CTP), targeting EHR workload and team cohesion.

Methods: A modified stepped wedge-clustered randomized trial was conducted, involving specialties with heavy InBasket workloads.

View Article and Find Full Text PDF

Dysgraphia often goes unnoticed in schools, leading to delayed academic development and diminished self-esteem for affected students. This case report provides keyboarding instruction to a nine-year-old Japanese boy diagnosed with dysgraphia and observes its impact on his writing performance, including speed, accuracy, and composition, and mental burden. The patient was diagnosed with dysgraphia and refusal to write at school.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!