VISUAL QUESTION ANSWERING REASONING SYNTHETIC DATA GENERATION USING LARGE VISION LANGUAGE MODEL
In the realm of Visual Question Answering (VQA), a substantial amount of data with reasoning aspects is required to ensure the development of systems capable of generating rational and reliable outputs. However, the large resources needed to create VQA reasoning data have driven the exploration o...
Saved in:
Main Author: | Amadeus Irawan, Patrick |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/86165 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
Similar Items
-
TRRNet : tiered relation reasoning for compositional visual question answering
by: Yang, Xiaofeng, et al.
Published: (2020) -
T-SciQ: Teaching multimodal Chain-of-Thought reasoning via large language model signals for science question answering
by: WANG, Lei, et al.
Published: (2024) -
Visual questioning and answering
by: Ong, Zavier Jian Le
Published: (2024) -
Question answer system for numerical reasoning in finance
by: Kothari, Khush Milan
Published: (2023) -
Debiasing visual question and answering with answer preference
by: Zhang, Xinye
Published: (2020)