Image similarity assessment based on Siamese network

In recent years, the deep learning algorithm has made breakthroughs in the field of image processing. It extracts the externally input data such as sound, image and text from low-level to high-level features by simulating and building the hierarchical architecture of the human brain. Therefore, the...

Full description

Saved in:
Bibliographic Details
Main Author: Chen, Cheng
Other Authors: Chen Lihui
Format: Theses and Dissertations
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/78603
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-78603
record_format dspace
spelling sg-ntu-dr.10356-786032023-07-04T16:15:52Z Image similarity assessment based on Siamese network Chen, Cheng Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision In recent years, the deep learning algorithm has made breakthroughs in the field of image processing. It extracts the externally input data such as sound, image and text from low-level to high-level features by simulating and building the hierarchical architecture of the human brain. Therefore, the result is more accurate and closes to the semantic features. Convolutional Neural Network(CNN) is the most widely used deep learning model with high recognition rate in the field of image processing. It performs convolution with the image pixels by trained kernels and can extract specific feature automatically. Besides, its weights sharing characteristic and pooling operation greatly reduce the parameters of the model and improve the training efficiency. The CNN model in this project is based on the VGG-like network, which is a classic CNN model and uses multiple stacked small kernels to extract the image feature with high accuracy. The conventional VGG model is simplified to reduce the training parameters in the experiment. Furthermore, Spatial Pyramid Pooling layer and dropout technique are combined with the simplified VGG model to increase the performance. Siamese network is a special neural network model which consists of two CNN branches with shared weights. The input of Siamese network is a pair of images and the two CNN branches extract the feature vectors and reduce the dimension of the image pair. Then, the distance of two vectors is calculated to assess the similarity of two images. In this project, the distance calculation and loss function is improved comparing to the original model. The experiment network model is built by TensorFlow with the API of Python. The dataset used for the experimental study is the screenshots of more than 10,000 APPs' interface. The APPs with similar interface screenshot can be considered as one category. Therefore, the similarity assessment model can be used to cluster different APPs. In this project, the loss curve of training and the accuracy curve are used as evaluation methods for the model. Experiments show that the model achieves acceptable performance on the given validation dataset. Master of Science (Signal Processing) 2019-06-24T06:02:47Z 2019-06-24T06:02:47Z 2019 Thesis http://hdl.handle.net/10356/78603 en 75 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Chen, Cheng
Image similarity assessment based on Siamese network
description In recent years, the deep learning algorithm has made breakthroughs in the field of image processing. It extracts the externally input data such as sound, image and text from low-level to high-level features by simulating and building the hierarchical architecture of the human brain. Therefore, the result is more accurate and closes to the semantic features. Convolutional Neural Network(CNN) is the most widely used deep learning model with high recognition rate in the field of image processing. It performs convolution with the image pixels by trained kernels and can extract specific feature automatically. Besides, its weights sharing characteristic and pooling operation greatly reduce the parameters of the model and improve the training efficiency. The CNN model in this project is based on the VGG-like network, which is a classic CNN model and uses multiple stacked small kernels to extract the image feature with high accuracy. The conventional VGG model is simplified to reduce the training parameters in the experiment. Furthermore, Spatial Pyramid Pooling layer and dropout technique are combined with the simplified VGG model to increase the performance. Siamese network is a special neural network model which consists of two CNN branches with shared weights. The input of Siamese network is a pair of images and the two CNN branches extract the feature vectors and reduce the dimension of the image pair. Then, the distance of two vectors is calculated to assess the similarity of two images. In this project, the distance calculation and loss function is improved comparing to the original model. The experiment network model is built by TensorFlow with the API of Python. The dataset used for the experimental study is the screenshots of more than 10,000 APPs' interface. The APPs with similar interface screenshot can be considered as one category. Therefore, the similarity assessment model can be used to cluster different APPs. In this project, the loss curve of training and the accuracy curve are used as evaluation methods for the model. Experiments show that the model achieves acceptable performance on the given validation dataset.
author2 Chen Lihui
author_facet Chen Lihui
Chen, Cheng
format Theses and Dissertations
author Chen, Cheng
author_sort Chen, Cheng
title Image similarity assessment based on Siamese network
title_short Image similarity assessment based on Siamese network
title_full Image similarity assessment based on Siamese network
title_fullStr Image similarity assessment based on Siamese network
title_full_unstemmed Image similarity assessment based on Siamese network
title_sort image similarity assessment based on siamese network
publishDate 2019
url http://hdl.handle.net/10356/78603
_version_ 1772827201798930432