CNN fixations : an unraveling approach to visualize the discriminative image regions

Deep convolutional neural networks (CNNs) have revolutionized the computer vision research and have seen unprecedented adoption for multiple tasks, such as classification, detection, and caption generation. However, they offer little transparency into their inner workings and are often treated as bl...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mopuri, Konda Reddy, Garg, Utsav, Babu, R. Venkatesh
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2020
Subjects:	Engineering::Computer science and engineering Explainable AI CNN Visualization
Online Access:	https://hdl.handle.net/10356/142317
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-142317
record_format	dspace
spelling	sg-ntu-dr.10356-1423172020-06-19T03:09:49Z CNN fixations : an unraveling approach to visualize the discriminative image regions Mopuri, Konda Reddy Garg, Utsav Babu, R. Venkatesh School of Computer Science and Engineering Engineering::Computer science and engineering Explainable AI CNN Visualization Deep convolutional neural networks (CNNs) have revolutionized the computer vision research and have seen unprecedented adoption for multiple tasks, such as classification, detection, and caption generation. However, they offer little transparency into their inner workings and are often treated as black boxes that deliver excellent performance. In this paper, we aim at alleviating this opaqueness of CNNs by providing visual explanations for the network's predictions. Our approach can analyze a variety of CNN-based models trained for computer vision applications, such as object recognition and caption generation. Unlike the existing methods, we achieve this via unraveling the forward pass operation. The proposed method exploits feature dependencies across the layer hierarchy and uncovers the discriminative image locations that guide the network's predictions. We name these locations CNN fixations, loosely analogous to human eye fixations. Our approach is a generic method that requires no architectural changes, additional training, or gradient computation, and computes the important image locations (CNN fixations). We demonstrate through a variety of applications that our approach is able to localize the discriminative image locations across different network architectures, diverse vision tasks, and data modalities. 2020-06-19T03:09:48Z 2020-06-19T03:09:48Z 2018 Journal Article Mopuri, K. R., Garg, U., & Babu, R. V. (2019). CNN fixations : an unraveling approach to visualize the discriminative image regions. IEEE Transactions on Image Processing, 28(5), 2116-2125. doi:10.1109/TIP.2018.2881920 1057-7149 https://hdl.handle.net/10356/142317 10.1109/TIP.2018.2881920 30452367 2-s2.0-85056698918 5 28 2116 2125 en IEEE Transactions on Image Processing © 2018 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Explainable AI CNN Visualization
spellingShingle	Engineering::Computer science and engineering Explainable AI CNN Visualization Mopuri, Konda Reddy Garg, Utsav Babu, R. Venkatesh CNN fixations : an unraveling approach to visualize the discriminative image regions
description	Deep convolutional neural networks (CNNs) have revolutionized the computer vision research and have seen unprecedented adoption for multiple tasks, such as classification, detection, and caption generation. However, they offer little transparency into their inner workings and are often treated as black boxes that deliver excellent performance. In this paper, we aim at alleviating this opaqueness of CNNs by providing visual explanations for the network's predictions. Our approach can analyze a variety of CNN-based models trained for computer vision applications, such as object recognition and caption generation. Unlike the existing methods, we achieve this via unraveling the forward pass operation. The proposed method exploits feature dependencies across the layer hierarchy and uncovers the discriminative image locations that guide the network's predictions. We name these locations CNN fixations, loosely analogous to human eye fixations. Our approach is a generic method that requires no architectural changes, additional training, or gradient computation, and computes the important image locations (CNN fixations). We demonstrate through a variety of applications that our approach is able to localize the discriminative image locations across different network architectures, diverse vision tasks, and data modalities.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Mopuri, Konda Reddy Garg, Utsav Babu, R. Venkatesh
format	Article
author	Mopuri, Konda Reddy Garg, Utsav Babu, R. Venkatesh
author_sort	Mopuri, Konda Reddy
title	CNN fixations : an unraveling approach to visualize the discriminative image regions
title_short	CNN fixations : an unraveling approach to visualize the discriminative image regions
title_full	CNN fixations : an unraveling approach to visualize the discriminative image regions
title_fullStr	CNN fixations : an unraveling approach to visualize the discriminative image regions
title_full_unstemmed	CNN fixations : an unraveling approach to visualize the discriminative image regions
title_sort	cnn fixations : an unraveling approach to visualize the discriminative image regions
publishDate	2020
url	https://hdl.handle.net/10356/142317
_version_	1681059066216972288

CNN fixations : an unraveling approach to visualize the discriminative image regions

Similar Items