Assessing rotational invariance of graph convolution neural networks for computer vision

This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small an...

Full description

Saved in:
Bibliographic Details
Main Author: Singh, Priyanshu Kumar
Other Authors: Xavier Bresson
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74246
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74246
record_format dspace
spelling sg-ntu-dr.10356-742462023-03-03T20:33:44Z Assessing rotational invariance of graph convolution neural networks for computer vision Singh, Priyanshu Kumar Xavier Bresson School of Computer Science and Engineering DRNTU::Engineering This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small angles, due to the pooling operation. But there is still a need for a CNN model which is able to learn truly rotationally invariant filters so that it is able to perform well on image datasets where the images preserve their identity irrespective of their orientation. Graphs are known to be isotropic in nature and hence, their edges do not have a sense of direction. This project leverages that property to prove that if images are converted into grid graphs and then graph CNNs are used to learn spectral filters over them, then those filters are rotationally invariant. A couple of architectures based on Lenet-5 and VGG16 respectively, were used to conduct comparative experiments between graph CNNs and standard CNNs and it was found that graph CNNs indeed showed much more invariance to rotation than the standard CNNs. The experiments were conducted on the MNIST and CIFAR-10 datasets and it was observed that graph CNNs were able to learn rotationally invariant spectral filters for both the datasets. Furthermore, it was seen that deeper architectures were able to encode more rotational invariance within their learned filters as compared to shallower ones. Bachelor of Engineering (Computer Science) 2018-05-14T05:23:33Z 2018-05-14T05:23:33Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74246 en Nanyang Technological University 55 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Singh, Priyanshu Kumar
Assessing rotational invariance of graph convolution neural networks for computer vision
description This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small angles, due to the pooling operation. But there is still a need for a CNN model which is able to learn truly rotationally invariant filters so that it is able to perform well on image datasets where the images preserve their identity irrespective of their orientation. Graphs are known to be isotropic in nature and hence, their edges do not have a sense of direction. This project leverages that property to prove that if images are converted into grid graphs and then graph CNNs are used to learn spectral filters over them, then those filters are rotationally invariant. A couple of architectures based on Lenet-5 and VGG16 respectively, were used to conduct comparative experiments between graph CNNs and standard CNNs and it was found that graph CNNs indeed showed much more invariance to rotation than the standard CNNs. The experiments were conducted on the MNIST and CIFAR-10 datasets and it was observed that graph CNNs were able to learn rotationally invariant spectral filters for both the datasets. Furthermore, it was seen that deeper architectures were able to encode more rotational invariance within their learned filters as compared to shallower ones.
author2 Xavier Bresson
author_facet Xavier Bresson
Singh, Priyanshu Kumar
format Final Year Project
author Singh, Priyanshu Kumar
author_sort Singh, Priyanshu Kumar
title Assessing rotational invariance of graph convolution neural networks for computer vision
title_short Assessing rotational invariance of graph convolution neural networks for computer vision
title_full Assessing rotational invariance of graph convolution neural networks for computer vision
title_fullStr Assessing rotational invariance of graph convolution neural networks for computer vision
title_full_unstemmed Assessing rotational invariance of graph convolution neural networks for computer vision
title_sort assessing rotational invariance of graph convolution neural networks for computer vision
publishDate 2018
url http://hdl.handle.net/10356/74246
_version_ 1759852954685276160