Open resource-aided image analytics
Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained o...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/78706 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-78706 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-787062023-07-04T16:07:20Z Open resource-aided image analytics Zheng, Weihua Mao Kezhi School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained on one training set is difficult to perform well on another data set. The main purpose of this project is to improve the description of the current data set by using additional information on the network. The performance of any local network on different data sets can be improved. At the same time, we have added an adaptive attention mechanism to the LSTM network. Whenever a neural network wants to generate a word, this adaptive mechanism can be used to determine whether or not to consider the characteristics of the image. This mechanism can make the statements generated by the local network more reasonable and conform to the image content. Master of Science (Signal Processing) 2019-06-26T01:53:13Z 2019-06-26T01:53:13Z 2019 Thesis http://hdl.handle.net/10356/78706 en 89 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Zheng, Weihua Open resource-aided image analytics |
description |
Image description is currently a hot research field. Most image description generation networks use only a certain data set to train a neural network, and then use the neural network to describe the input image. However, due to the different distribution of different data sets, the network trained on one training set is difficult to perform well on another data set. The main purpose of this project is to improve the description of the current data set by using additional information on the network. The performance of any local network on different data sets can be improved. At the same time, we have added an adaptive attention mechanism to the LSTM network. Whenever a neural network wants to generate a word, this adaptive mechanism can be used to determine whether or not to consider the characteristics of the image. This mechanism can make the statements generated by the local network more reasonable and conform to the image content. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Zheng, Weihua |
format |
Theses and Dissertations |
author |
Zheng, Weihua |
author_sort |
Zheng, Weihua |
title |
Open resource-aided image analytics |
title_short |
Open resource-aided image analytics |
title_full |
Open resource-aided image analytics |
title_fullStr |
Open resource-aided image analytics |
title_full_unstemmed |
Open resource-aided image analytics |
title_sort |
open resource-aided image analytics |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/78706 |
_version_ |
1772828603708342272 |