Neural image and video captioning (NIVC)

A common problem linking computer vision and natural language processing is the ability to generate accurate captioning for a given image. Researchers have spent decades trying to perfect the state of art image captioning. In this paper, various approaches of image captioning models towards ach...

Full description

Saved in:
Bibliographic Details
Main Author: Lee, Jeremy Kian Kiat
Other Authors: Zhang Hanwang
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156511
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156511
record_format dspace
spelling sg-ntu-dr.10356-1565112022-04-19T06:07:35Z Neural image and video captioning (NIVC) Lee, Jeremy Kian Kiat Zhang Hanwang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering A common problem linking computer vision and natural language processing is the ability to generate accurate captioning for a given image. Researchers have spent decades trying to perfect the state of art image captioning. In this paper, various approaches of image captioning models towards achieving a state of the art results are studied. After the various approaches are studied, the best approaches are then extracted and then recombined into a new single model in hopes of achieving a new state of the art model. Furthermore, this paper proposes a sharing platform that allows users to apply the prediction model built as a real-world use case. Live captioning is proposed to utilize the inceptionV4 model to provide a description of an image. The platform comes in the form of a mobile application and is equipped with valuable functionalities to caption an image and share the inspiration on the free platform for different individuals to exchange their ideas Bachelor of Engineering (Computer Science) 2022-04-19T06:07:35Z 2022-04-19T06:07:35Z 2022 Final Year Project (FYP) Lee, J. K. K. (2022). Neural image and video captioning (NIVC). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156511 https://hdl.handle.net/10356/156511 en SCSE21-0520 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Lee, Jeremy Kian Kiat
Neural image and video captioning (NIVC)
description A common problem linking computer vision and natural language processing is the ability to generate accurate captioning for a given image. Researchers have spent decades trying to perfect the state of art image captioning. In this paper, various approaches of image captioning models towards achieving a state of the art results are studied. After the various approaches are studied, the best approaches are then extracted and then recombined into a new single model in hopes of achieving a new state of the art model. Furthermore, this paper proposes a sharing platform that allows users to apply the prediction model built as a real-world use case. Live captioning is proposed to utilize the inceptionV4 model to provide a description of an image. The platform comes in the form of a mobile application and is equipped with valuable functionalities to caption an image and share the inspiration on the free platform for different individuals to exchange their ideas
author2 Zhang Hanwang
author_facet Zhang Hanwang
Lee, Jeremy Kian Kiat
format Final Year Project
author Lee, Jeremy Kian Kiat
author_sort Lee, Jeremy Kian Kiat
title Neural image and video captioning (NIVC)
title_short Neural image and video captioning (NIVC)
title_full Neural image and video captioning (NIVC)
title_fullStr Neural image and video captioning (NIVC)
title_full_unstemmed Neural image and video captioning (NIVC)
title_sort neural image and video captioning (nivc)
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156511
_version_ 1731235734727163904