Modularized zero-shot VQA with pre-trained models

Large-scale pre-trained models (PTMs) show great zero-shot capabilities. In this paper, we study how to leverage them for zero-shot visual question answering (VQA).Our approach is motivated by a few observations. First, VQA questions often require multiple steps of reasoning, which is still a capabi...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Rui, JIANG, Jing
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8307
https://ink.library.smu.edu.sg/context/sis_research/article/9310/viewcontent/ACL_Findings_Camera_Ready.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English