Open resource-aided image analytics

Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained o...

Full description

Saved in:
Bibliographic Details
Main Author: Zheng, Weihua
Other Authors: Mao Kezhi
Format: Theses and Dissertations
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/78706
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-78706
record_format dspace
spelling sg-ntu-dr.10356-787062023-07-04T16:07:20Z Open resource-aided image analytics Zheng, Weihua Mao Kezhi School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained on one training set is difficult to perform well on another data set. The main purpose of this project is to improve the description of the current data set by using additional information on the network. The performance of any local network on different data sets can be improved. At the same time, we have added an adaptive attention mechanism to the LSTM network. Whenever a neural network wants to generate a word, this adaptive mechanism can be used to determine whether or not to consider the characteristics of the image. This mechanism can make the statements generated by the local network more reasonable and conform to the image content. Master of Science (Signal Processing) 2019-06-26T01:53:13Z 2019-06-26T01:53:13Z 2019 Thesis http://hdl.handle.net/10356/78706 en 89 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Zheng, Weihua
Open resource-aided image analytics
description Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained on one training set is difficult to perform well on another data set. The main purpose of this project is to improve the description of the current data set by using additional information on the network. The performance of any local network on different data sets can be improved. At the same time, we have added an adaptive attention mechanism to the LSTM network. Whenever a neural network wants to generate a word, this adaptive mechanism can be used to determine whether or not to consider the characteristics of the image. This mechanism can make the statements generated by the local network more reasonable and conform to the image content.
author2 Mao Kezhi
author_facet Mao Kezhi
Zheng, Weihua
format Theses and Dissertations
author Zheng, Weihua
author_sort Zheng, Weihua
title Open resource-aided image analytics
title_short Open resource-aided image analytics
title_full Open resource-aided image analytics
title_fullStr Open resource-aided image analytics
title_full_unstemmed Open resource-aided image analytics
title_sort open resource-aided image analytics
publishDate 2019
url http://hdl.handle.net/10356/78706
_version_ 1772828603708342272