Testing convolutional neural network based deep learning systems: a statistical metamorphic approach.

PeerJ Comput Sci

Gianforte School of Computing, Montana State University, Bozeman, Montana, United States.

Published: January 2025

Machine learning technology spans many areas and today plays a significant role in addressing a wide range of problems in critical domains, ., healthcare, autonomous driving, finance, manufacturing, cybersecurity, . Metamorphic testing (MT) is considered a simple but very powerful approach in testing such computationally complex systems for which either an oracle is not available or is available but difficult to apply. Conventional metamorphic testing techniques have certain limitations in verifying deep learning-based models (., convolutional neural networks (CNNs)) that have a stochastic nature (because of randomly initializing the network weights) in their training. In this article, we attempt to address this problem by using a statistical metamorphic testing (SMT) technique that does not require software testers to worry about fixing the random seeds (to get deterministic results) to verify the metamorphic relations (MRs). We propose seven MRs combined with different statistical methods to statistically verify whether the program under test adheres to the relation(s) specified in the MR(s). We further use mutation testing techniques to show the usefulness of the proposed approach in the healthcare space and test two CNN-based deep learning models (used for pneumonia detection among patients). The empirical results show that our proposed approach uncovers 85.71% of the implementation faults in the classifiers under test (CUT). Furthermore, we also propose an MRs minimization algorithm for the CUT, thus saving computational costs and organizational testing resources.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888866PMC
http://dx.doi.org/10.7717/peerj-cs.2658DOI Listing

Publication Analysis

Top Keywords

metamorphic testing
12
convolutional neural
8
deep learning
8
statistical metamorphic
8
testing techniques
8
proposed approach
8
testing
7
metamorphic
5
testing convolutional
4
neural network
4

Similar Publications

Machine learning technology spans many areas and today plays a significant role in addressing a wide range of problems in critical domains, ., healthcare, autonomous driving, finance, manufacturing, cybersecurity, . Metamorphic testing (MT) is considered a simple but very powerful approach in testing such computationally complex systems for which either an oracle is not available or is available but difficult to apply.

View Article and Find Full Text PDF

At ontogenetic transitions, animals often exhibit plastic variation in development, behavior and physiology in response to environmental conditions. Most terrestrial-breeding frogs have aquatic larval periods. Some species can extend their initial terrestrial period, as either a plastic embryonic response to balance trade-offs across environments or an enforced wait for rain that allows larvae to access aquatic habitats.

View Article and Find Full Text PDF

The Haba Snow Mountain Tunnel experienced severe deformation caused by foliated metamorphic basalt. This rock has high metamorphism, easy weathering, and low strength. This study examines the rock's microscopic characteristics and mechanical properties to understand its impact on tunnel stability and offer guidance for similar engineering challenges.

View Article and Find Full Text PDF

It has frequently been hypothesized that among-individual variation in behavior and physiology will correlate with life history traits, yet the nature of these correlations can vary. Such variability may arise from plasticity in trait development, which can amplify or attenuate trait correlations across different environments. Using the Mexican spadefoot toad (Spea multiplicata), we tested whether relationships between larval growth rate and post-metamorphic behavior or physiology are influenced by a key mediator of developmental plasticity: larval diet type.

View Article and Find Full Text PDF

Purpose: Typical clinical "in use" conditions for topical semisolids involve their application as a thin film, often with rubbing that can induce metamorphic stress. Yet, product quality and performance tests often characterize the manufactured product, and may not consider product metamorphosis (e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!