Assessing rotational invariance of graph convolution neural networks for computer vision

This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small an...

全面介紹

Saved in:
書目詳細資料
主要作者: Singh, Priyanshu Kumar
其他作者: Xavier Bresson
格式: Final Year Project
語言:English
出版: 2018
主題:
在線閱讀:http://hdl.handle.net/10356/74246
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
id sg-ntu-dr.10356-74246
record_format dspace
spelling sg-ntu-dr.10356-742462023-03-03T20:33:44Z Assessing rotational invariance of graph convolution neural networks for computer vision Singh, Priyanshu Kumar Xavier Bresson School of Computer Science and Engineering DRNTU::Engineering This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small angles, due to the pooling operation. But there is still a need for a CNN model which is able to learn truly rotationally invariant filters so that it is able to perform well on image datasets where the images preserve their identity irrespective of their orientation. Graphs are known to be isotropic in nature and hence, their edges do not have a sense of direction. This project leverages that property to prove that if images are converted into grid graphs and then graph CNNs are used to learn spectral filters over them, then those filters are rotationally invariant. A couple of architectures based on Lenet-5 and VGG16 respectively, were used to conduct comparative experiments between graph CNNs and standard CNNs and it was found that graph CNNs indeed showed much more invariance to rotation than the standard CNNs. The experiments were conducted on the MNIST and CIFAR-10 datasets and it was observed that graph CNNs were able to learn rotationally invariant spectral filters for both the datasets. Furthermore, it was seen that deeper architectures were able to encode more rotational invariance within their learned filters as compared to shallower ones. Bachelor of Engineering (Computer Science) 2018-05-14T05:23:33Z 2018-05-14T05:23:33Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74246 en Nanyang Technological University 55 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Singh, Priyanshu Kumar
Assessing rotational invariance of graph convolution neural networks for computer vision
description This project aims to assess the property of rotational invariance within graph convolution neural networks (graph CNNs) for learning on images. Standard CNNs possess the property of translational invariance due to the sliding nature of the convolution operation and rotational invariance for small angles, due to the pooling operation. But there is still a need for a CNN model which is able to learn truly rotationally invariant filters so that it is able to perform well on image datasets where the images preserve their identity irrespective of their orientation. Graphs are known to be isotropic in nature and hence, their edges do not have a sense of direction. This project leverages that property to prove that if images are converted into grid graphs and then graph CNNs are used to learn spectral filters over them, then those filters are rotationally invariant. A couple of architectures based on Lenet-5 and VGG16 respectively, were used to conduct comparative experiments between graph CNNs and standard CNNs and it was found that graph CNNs indeed showed much more invariance to rotation than the standard CNNs. The experiments were conducted on the MNIST and CIFAR-10 datasets and it was observed that graph CNNs were able to learn rotationally invariant spectral filters for both the datasets. Furthermore, it was seen that deeper architectures were able to encode more rotational invariance within their learned filters as compared to shallower ones.
author2 Xavier Bresson
author_facet Xavier Bresson
Singh, Priyanshu Kumar
format Final Year Project
author Singh, Priyanshu Kumar
author_sort Singh, Priyanshu Kumar
title Assessing rotational invariance of graph convolution neural networks for computer vision
title_short Assessing rotational invariance of graph convolution neural networks for computer vision
title_full Assessing rotational invariance of graph convolution neural networks for computer vision
title_fullStr Assessing rotational invariance of graph convolution neural networks for computer vision
title_full_unstemmed Assessing rotational invariance of graph convolution neural networks for computer vision
title_sort assessing rotational invariance of graph convolution neural networks for computer vision
publishDate 2018
url http://hdl.handle.net/10356/74246
_version_ 1759852954685276160