Neural image and video captioning (NIVC)
A common problem linking computer vision and natural language processing is the ability to generate accurate captioning for a given image. Researchers have spent decades trying to perfect the state of art image captioning. In this paper, various approaches of image captioning models towards ach...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156511 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156511 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1565112022-04-19T06:07:35Z Neural image and video captioning (NIVC) Lee, Jeremy Kian Kiat Zhang Hanwang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering A common problem linking computer vision and natural language processing is the ability to generate accurate captioning for a given image. Researchers have spent decades trying to perfect the state of art image captioning. In this paper, various approaches of image captioning models towards achieving a state of the art results are studied. After the various approaches are studied, the best approaches are then extracted and then recombined into a new single model in hopes of achieving a new state of the art model. Furthermore, this paper proposes a sharing platform that allows users to apply the prediction model built as a real-world use case. Live captioning is proposed to utilize the inceptionV4 model to provide a description of an image. The platform comes in the form of a mobile application and is equipped with valuable functionalities to caption an image and share the inspiration on the free platform for different individuals to exchange their ideas Bachelor of Engineering (Computer Science) 2022-04-19T06:07:35Z 2022-04-19T06:07:35Z 2022 Final Year Project (FYP) Lee, J. K. K. (2022). Neural image and video captioning (NIVC). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156511 https://hdl.handle.net/10356/156511 en SCSE21-0520 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Lee, Jeremy Kian Kiat Neural image and video captioning (NIVC) |
description |
A common problem linking computer vision and natural language processing is the ability to
generate accurate captioning for a given image. Researchers have spent decades trying to
perfect the state of art image captioning.
In this paper, various approaches of image captioning models towards achieving a state of the
art results are studied. After the various approaches are studied, the best approaches are then
extracted and then recombined into a new single model in hopes of achieving a new state of
the art model.
Furthermore, this paper proposes a sharing platform that allows users to apply the prediction
model built as a real-world use case. Live captioning is proposed to utilize the inceptionV4
model to provide a description of an image. The platform comes in the form of a mobile
application and is equipped with valuable functionalities to caption an image and share the
inspiration on the free platform for different individuals to exchange their ideas |
author2 |
Zhang Hanwang |
author_facet |
Zhang Hanwang Lee, Jeremy Kian Kiat |
format |
Final Year Project |
author |
Lee, Jeremy Kian Kiat |
author_sort |
Lee, Jeremy Kian Kiat |
title |
Neural image and video captioning (NIVC) |
title_short |
Neural image and video captioning (NIVC) |
title_full |
Neural image and video captioning (NIVC) |
title_fullStr |
Neural image and video captioning (NIVC) |
title_full_unstemmed |
Neural image and video captioning (NIVC) |
title_sort |
neural image and video captioning (nivc) |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156511 |
_version_ |
1731235734727163904 |