Model explainability is one of the crucial ingredients for building trustable AI systems, especially in the applications requiring reliability such as automated driving and diagnosis. Many explainability methods have been studied in the literature. Among many others, this article focuses on a research line that tries to visually explain a pre-trained image classification model such as Convolutional Neural Network by discovering concepts learned by the model, which is so-called the concept-based explanation. Previous concept-based explanation methods rely on the human definition of concepts (e.g., the Broden dataset) or semantic segmentation techniques like Slic (Simple Linear Iterative Clustering). However, we argue that the concepts identified by those methods may show image parts which are more in line with a human perspective or cropped by a segmentation method, rather than purely reflect a model's own perspective. We propose Model-Oriented Concept Extraction (MOCE), a novel approach to extracting key concepts based solely on a model itself, thereby being able to capture its unique perspectives which are not affected by any external factors. Experimental results on various pre-trained models confirmed the advantages of extracting concepts by truly representing the model's point of view.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2024.3357717DOI Listing

Publication Analysis

Top Keywords

concept-based explanation
8
concepts
6
model
5
model at?
4
at? extracting
4
extracting model-oriented
4
model-oriented concepts
4
concepts explaining
4
explaining deep
4
deep neural
4

Similar Publications

Article Synopsis
  • Deep learning has become the top choice for computer vision and decision-making tasks in the last decade, but its complex models raise concerns about fairness and understanding their decisions.
  • Existing methods provide insights into how low-level features relate to outcomes but struggle to connect these to higher-level human reasoning, leading to irrelevant concept extraction and limited local explanations.
  • The study introduces CA-SoftNet, a two-part model blending fast pattern recognition and clear reasoning, which shows improved accuracy on diverse datasets and offers a better way to extract and explain important concepts in a human-understandable manner.
View Article and Find Full Text PDF

MulCPred: Learning Multi-Modal Concepts for Explainable Pedestrian Action Prediction.

Sensors (Basel)

October 2024

Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan.

Article Synopsis
  • The paper introduces MulCPred, a new framework designed to provide explainable predictions for pedestrian action, which is essential for applications like autonomous driving.
  • It addresses limitations of existing methods by using a linear aggregator for multi-modal concept integration, a channel-wise recalibration module for focusing on detailed input areas, and a regularization loss to capture diverse patterns.
  • Evaluation on various datasets shows that MulCPred enhances the explainability of predictions without significantly harming accuracy, and by filtering out unrecognizable concepts, it improves performance across different datasets.
View Article and Find Full Text PDF

Concept-based AI interpretability in physiological time-series data: Example of abnormality detection in electroencephalography.

Comput Methods Programs Biomed

December 2024

Institute for Biomedical Informatics, Faculty of Medicine, University Hospital Cologne, University of Cologne, Cologne, Germany.

Article Synopsis
  • Despite advancements in deep learning, its complex and opaque nature hinders widespread clinical adoption, prompting interest in concept-based interpretability, specifically using techniques like Testing with Concept Activation Vectors (TCAV).
  • This study applies TCAV to abnormality detection in electroencephalography (EEG), utilizing the XceptionTime model on multi-channel physiological data to enhance interpretability and analyze concepts linked to EEG pathologies.
  • The results indicate that TCAV scores align with clinical expectations, demonstrating its potential for improving interpretability in deep learning models and identifying biases in medical data.
View Article and Find Full Text PDF

Existing deep learning methods have achieved remarkable results in diagnosing retinal diseases, showcasing the potential of advanced AI in ophthalmology. However, the black-box nature of these methods obscures the decision-making process, compromising their trustworthiness and acceptability. Inspired by the concept-based approaches and recognizing the intrinsic correlation between retinal lesions and diseases, we regard retinal lesions as concepts and propose an inherently interpretable framework designed to enhance both the performance and explainability of diagnostic models.

View Article and Find Full Text PDF

End-to-end deep learning models are increasingly applied to safety-critical human activity recognition (HAR) applications, e.g., healthcare monitoring and smart home control, to reduce developer burden and increase the performance and robustness of prediction models.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!