Visual questioning and answering

With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visua...

Full description

Saved in:
Bibliographic Details
Main Author: Ong, Zavier Jian Le
Other Authors: Hanwang Zhang
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175465
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175465
record_format dspace
spelling sg-ntu-dr.10356-1754652024-04-26T15:45:16Z Visual questioning and answering Ong, Zavier Jian Le Hanwang Zhang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Computer and Information Science Visual question and answering Computer science Engineering With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visual Question and Answering system. The system accepts a query in the form of a natural language and an input image, and outputs a response similarly in the form of a natural language. This system demonstrates the challenges of computer vision and natural language processing tied to work together harmoniously to produce an accurate output. Thus, this project will showcase an architectural design of this system as well as the implementation method used to demo the aforementioned. Bachelor's degree 2024-04-24T08:03:25Z 2024-04-24T08:03:25Z 2024 Final Year Project (FYP) Ong, Z. J. L. (2024). Visual questioning and answering. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175465 https://hdl.handle.net/10356/175465 en SCSE23-0216 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Visual question and answering
Computer science
Engineering
spellingShingle Computer and Information Science
Visual question and answering
Computer science
Engineering
Ong, Zavier Jian Le
Visual questioning and answering
description With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visual Question and Answering system. The system accepts a query in the form of a natural language and an input image, and outputs a response similarly in the form of a natural language. This system demonstrates the challenges of computer vision and natural language processing tied to work together harmoniously to produce an accurate output. Thus, this project will showcase an architectural design of this system as well as the implementation method used to demo the aforementioned.
author2 Hanwang Zhang
author_facet Hanwang Zhang
Ong, Zavier Jian Le
format Final Year Project
author Ong, Zavier Jian Le
author_sort Ong, Zavier Jian Le
title Visual questioning and answering
title_short Visual questioning and answering
title_full Visual questioning and answering
title_fullStr Visual questioning and answering
title_full_unstemmed Visual questioning and answering
title_sort visual questioning and answering
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175465
_version_ 1800916138128834560