Techniques in enhancing computation and understanding of convolutional neural networks

Convolutional Neural Networks (CNNs) are effective in solving a large number of complex tasks. The performance of CNNs is currently equaling or even surpassing the human performance level in a wide range of real-world problems. Such high performance is achieved at the cost of high computational and...

Full description

Saved in:
Bibliographic Details
Main Author: Abdiyeva, Kamila
Other Authors: Yap Kim Hui
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/154072
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-154072
record_format dspace
spelling sg-ntu-dr.10356-1540722023-07-04T17:41:47Z Techniques in enhancing computation and understanding of convolutional neural networks Abdiyeva, Kamila Yap Kim Hui School of Electrical and Electronic Engineering A*STAR EKHYap@ntu.edu.sg Engineering::Computer science and engineering Convolutional Neural Networks (CNNs) are effective in solving a large number of complex tasks. The performance of CNNs is currently equaling or even surpassing the human performance level in a wide range of real-world problems. Such high performance is achieved at the cost of high computational and storage requirements. To satisfy these computational requirements, specialized hardware such as Graphics Processing Units (GPUs) or Tensor Processing Units (TPUs) is required. Besides, CNNs are mainly used as a black-box tool, and only several attempts were made for their understanding. In this thesis, two studies are provided to address the problems of lack of understanding and high computational requirements of CNNs. The first study, introduced in Chapter 3, investigates and proposes a method for enhancing CNN computation by reducing the number of computational operations performed. We propose a new method for the computation enhancement in CNNs that substitutes Multiply and Accumulate (MAC) operations with a codebook lookup. The proposed method, Quantized-by-Lookup Network (QL-Net), combines several concepts: (i) a codebook construction, (ii) a layer-wise retraining strategy, and (iii) substitution of the MAC operations with the lookup of the convolution responses at inference time. The proposed QL-Net achieves good performance on datasets such as MNIST and CIFAR-10. The second study provides a better CNN understanding by studying the importance of each learned feature for an individual object class recognition. The experimental work in Chapter 4 extends the current understanding of the CNN filters' roles, their mutual interactions, and their relationship to classification accuracy. Additionally, the study showed that the classification accuracy of some classes from the target objects' set could be improved by removing the sub-set of filters with the least contribution to these classes. Master of Engineering 2021-12-17T03:44:16Z 2021-12-17T03:44:16Z 2021 Thesis-Master by Research Abdiyeva, K. (2021). Techniques in enhancing computation and understanding of convolutional neural networks. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/154072 https://hdl.handle.net/10356/154072 10.32657/10356/154072 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Abdiyeva, Kamila
Techniques in enhancing computation and understanding of convolutional neural networks
description Convolutional Neural Networks (CNNs) are effective in solving a large number of complex tasks. The performance of CNNs is currently equaling or even surpassing the human performance level in a wide range of real-world problems. Such high performance is achieved at the cost of high computational and storage requirements. To satisfy these computational requirements, specialized hardware such as Graphics Processing Units (GPUs) or Tensor Processing Units (TPUs) is required. Besides, CNNs are mainly used as a black-box tool, and only several attempts were made for their understanding. In this thesis, two studies are provided to address the problems of lack of understanding and high computational requirements of CNNs. The first study, introduced in Chapter 3, investigates and proposes a method for enhancing CNN computation by reducing the number of computational operations performed. We propose a new method for the computation enhancement in CNNs that substitutes Multiply and Accumulate (MAC) operations with a codebook lookup. The proposed method, Quantized-by-Lookup Network (QL-Net), combines several concepts: (i) a codebook construction, (ii) a layer-wise retraining strategy, and (iii) substitution of the MAC operations with the lookup of the convolution responses at inference time. The proposed QL-Net achieves good performance on datasets such as MNIST and CIFAR-10. The second study provides a better CNN understanding by studying the importance of each learned feature for an individual object class recognition. The experimental work in Chapter 4 extends the current understanding of the CNN filters' roles, their mutual interactions, and their relationship to classification accuracy. Additionally, the study showed that the classification accuracy of some classes from the target objects' set could be improved by removing the sub-set of filters with the least contribution to these classes.
author2 Yap Kim Hui
author_facet Yap Kim Hui
Abdiyeva, Kamila
format Thesis-Master by Research
author Abdiyeva, Kamila
author_sort Abdiyeva, Kamila
title Techniques in enhancing computation and understanding of convolutional neural networks
title_short Techniques in enhancing computation and understanding of convolutional neural networks
title_full Techniques in enhancing computation and understanding of convolutional neural networks
title_fullStr Techniques in enhancing computation and understanding of convolutional neural networks
title_full_unstemmed Techniques in enhancing computation and understanding of convolutional neural networks
title_sort techniques in enhancing computation and understanding of convolutional neural networks
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/154072
_version_ 1772826218475814912