CNN Fixations: An Unraveling Approach to Visualize the Discriminative Image Regions.

Konda Reddy Mopuri Utsav Garg R Venkatesh Babu

IEEE Trans Image Process

Published: May 2019

Deep convolutional neural networks (CNNs) have revolutionized the computer vision research and have seen unprecedented adoption for multiple tasks, such as classification, detection, and caption generation. However, they offer little transparency into their inner workings and are often treated as black boxes that deliver excellent performance. In this paper, we aim at alleviating this opaqueness of CNNs by providing visual explanations for the network's predictions. Our approach can analyze a variety of CNN-based models trained for computer vision applications, such as object recognition and caption generation. Unlike the existing methods, we achieve this via unraveling the forward pass operation. The proposed method exploits feature dependencies across the layer hierarchy and uncovers the discriminative image locations that guide the network's predictions. We name these locations CNN fixations, loosely analogous to human eye fixations. Our approach is a generic method that requires no architectural changes, additional training, or gradient computation, and computes the important image locations (CNN fixations). We demonstrate through a variety of applications that our approach is able to localize the discriminative image locations across different network architectures, diverse vision tasks, and data modalities.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TIP.2018.2881920	DOI Listing

Publication Analysis

Top Keywords

cnn fixations

discriminative image

image locations

computer vision

caption generation

network's predictions

locations cnn

fixations unraveling

approach

unraveling approach

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!