Robustness evaluation of deep neural networks for endoscopic image analysis: Insights and strategies.

Tim J M Jaspers Tim G W Boers Carolus H J Kusters Martijn R Jong Jelmer B Jukema Albert J de Groof Jacques J Bergman Peter H N de With Fons van der Sommen

Med Image Anal

Department of Electrical Engineering, Video Coding & Architectures, Eindhoven University of Technology, Eindhoven, The Netherlands.

Published: May 2024

Computer-aided detection and diagnosis systems (CADe/CADx) in endoscopy are commonly trained using high-quality imagery, which is not representative for the heterogeneous input typically encountered in clinical practice. In endoscopy, the image quality heavily relies on both the skills and experience of the endoscopist and the specifications of the system used for screening. Factors such as poor illumination, motion blur, and specific post-processing settings can significantly alter the quality and general appearance of these images. This so-called domain gap between the data used for developing the system and the data it encounters after deployment, and the impact it has on the performance of deep neural networks (DNNs) supportive endoscopic CAD systems remains largely unexplored. As many of such systems, for e.g. polyp detection, are already being rolled out in clinical practice, this poses severe patient risks in particularly community hospitals, where both the imaging equipment and experience are subject to considerable variation. Therefore, this study aims to evaluate the impact of this domain gap on the clinical performance of CADe/CADx for various endoscopic applications. For this, we leverage two publicly available data sets (KVASIR-SEG and GIANA) and two in-house data sets. We investigate the performance of commonly-used DNN architectures under synthetic, clinically calibrated image degradations and on a prospectively collected dataset including 342 endoscopic images of lower subjective quality. Additionally, we assess the influence of DNN architecture and complexity, data augmentation, and pretraining techniques for improved robustness. The results reveal a considerable decline in performance of 11.6% (±1.5) as compared to the reference, within the clinically calibrated boundaries of image degradations. Nevertheless, employing more advanced DNN architectures and self-supervised in-domain pre-training effectively mitigate this drop to 7.7% (±2.03). Additionally, these enhancements yield the highest performance on the manually collected test set including images with lower subjective quality. By comprehensively assessing the robustness of popular DNN architectures and training strategies across multiple datasets, this study provides valuable insights into their performance and limitations for endoscopic applications. The findings highlight the importance of including robustness evaluation when developing DNNs for endoscopy applications and propose strategies to mitigate performance loss.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.media.2024.103157	DOI Listing

Publication Analysis

Top Keywords

dnn architectures

robustness evaluation

deep neural

neural networks

clinical practice

domain gap

endoscopic applications

data sets

clinically calibrated

image degradations

Similar Publications

IMPACT: In-Memory ComPuting Architecture based on Y-FlAsh Technology for Coalesced Tsetlin machine inference.

Philos Trans A Math Phys Eng Sci

January 2025

Microsystems Group, School of Engineering, Newcastle University, Newcastle upon Tyne NE1 7RU, UK.

Omar Ghazal Wei Wang Shahar Kvatinsky Farhad Merchant Alex Yakovlev

The increasing demand for processing large volumes of data for machine learning (ML) models has pushed data bandwidth requirements beyond the capability of traditional von Neumann architecture. In-memory computing (IMC) has recently emerged as a promising solution to address this gap by enabling distributed data storage and processing at the micro-architectural level, significantly reducing both latency and energy. In this article, we present In-Memory comPuting architecture based on Y-FlAsh technology for Coalesced Tsetlin machine inference (IMPACT), underpinned on a cutting-edge memory device, Y-Flash, fabricated on a 180 nm complementary metal oxide semiconductor (CMOS) process.

View Article and Find Full Text PDF

Similar Publications

Deep neural networks have an inbuilt Occam's razor.

Nat Commun

January 2025

Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK.

Chris Mingard Henry Rees Guillermo Valle-Pérez Ard A Louis

The remarkable performance of overparameterized deep neural networks (DNNs) must arise from an interplay between network architecture, training algorithms, and structure in the data. To disentangle these three components for supervised learning, we apply a Bayesian picture based on the functions expressed by a DNN. The prior over functions is determined by the network architecture, which we vary by exploiting a transition between ordered and chaotic regimes.

View Article and Find Full Text PDF

Similar Publications

Interpretable Deep-Learning p Prediction for Small Molecule Drugs via Atomic Sensitivity Analysis.

J Chem Inf Model

January 2025

Department of Chemical and Physical Biology, Vanderbilt University, Nashville, Tennessee 37232, United States.

Joseph DeCorte Benjamin Brown Rathmell Jeffrey Jens Meiler

Machine learning (ML) models now play a crucial role in predicting properties essential to drug development, such as a drug's logscale acid-dissociation constant (p). Despite recent architectural advances, these models often generalize poorly to novel compounds due to a scarcity of ground-truth data. Further, these models lack interpretability.

View Article and Find Full Text PDF

Similar Publications

Integrating Interpretability in Machine Learning and Deep Neural Networks: A Novel Approach to Feature Importance and Outlier Detection in COVID-19 Symptomatology and Vaccine Efficacy.

Viruses

November 2024

Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.

Shadi Jacob Khoury Yazeed Zoabi Mickey Scheinowitz Noam Shomron

In this study, we introduce a novel approach that integrates interpretability techniques from both traditional machine learning (ML) and deep neural networks (DNN) to quantify feature importance using global and local interpretation methods. Our method bridges the gap between interpretable ML models and powerful deep learning (DL) architectures, providing comprehensive insights into the key drivers behind model predictions, especially in detecting outliers within medical data. We applied this method to analyze COVID-19 pandemic data from 2020, yielding intriguing insights.

View Article and Find Full Text PDF

Similar Publications

Fusion of Visible and Infrared Aerial Images from Uncalibrated Sensors Using Wavelet Decomposition and Deep Learning.

Sensors (Basel)

December 2024

Department of Electrical and Computer Engineering, University of Missouri, Columbia, MO 65211, USA.

Chandrakanth Vipparla Timothy Krock Koundinya Nouduri Joshua Fraser Hadi AliAkbarpour

Multi-modal systems extract information about the environment using specialized sensors that are optimized based on the wavelength of the phenomenology and material interactions. To maximize the entropy, complementary systems operating in regions of non-overlapping wavelengths are optimal. VIS-IR (Visible-Infrared) systems have been at the forefront of multi-modal fusion research and are used extensively to represent information in all-day all-weather applications.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!