Visual questioning and answering
With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visua...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175465 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-175465 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1754652024-04-26T15:45:16Z Visual questioning and answering Ong, Zavier Jian Le Hanwang Zhang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Computer and Information Science Visual question and answering Computer science Engineering With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visual Question and Answering system. The system accepts a query in the form of a natural language and an input image, and outputs a response similarly in the form of a natural language. This system demonstrates the challenges of computer vision and natural language processing tied to work together harmoniously to produce an accurate output. Thus, this project will showcase an architectural design of this system as well as the implementation method used to demo the aforementioned. Bachelor's degree 2024-04-24T08:03:25Z 2024-04-24T08:03:25Z 2024 Final Year Project (FYP) Ong, Z. J. L. (2024). Visual questioning and answering. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175465 https://hdl.handle.net/10356/175465 en SCSE23-0216 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Visual question and answering Computer science Engineering |
spellingShingle |
Computer and Information Science Visual question and answering Computer science Engineering Ong, Zavier Jian Le Visual questioning and answering |
description |
With the rising trend of artificial intelligence and machine learning, more and more intelligent tasks thought to be previously impossible are now feasible and capable to be implemented and automated by machines. This project will look into the implementation and automation of one such task, a Visual Question and Answering system. The system accepts a query in the form of a natural language and an input image, and outputs a response similarly in the form of a natural language. This system demonstrates the challenges of computer vision and natural language processing tied to work together harmoniously to produce an accurate output. Thus, this project will showcase an architectural design of this system as well as the implementation method used to demo the aforementioned. |
author2 |
Hanwang Zhang |
author_facet |
Hanwang Zhang Ong, Zavier Jian Le |
format |
Final Year Project |
author |
Ong, Zavier Jian Le |
author_sort |
Ong, Zavier Jian Le |
title |
Visual questioning and answering |
title_short |
Visual questioning and answering |
title_full |
Visual questioning and answering |
title_fullStr |
Visual questioning and answering |
title_full_unstemmed |
Visual questioning and answering |
title_sort |
visual questioning and answering |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/175465 |
_version_ |
1800916138128834560 |