Modularized zero-shot VQA with pre-trained models
Large-scale pre-trained models (PTMs) show great zero-shot capabilities. In this paper, we study how to leverage them for zero-shot visual question answering (VQA).Our approach is motivated by a few observations. First, VQA questions often require multiple steps of reasoning, which is still a capabi...
Saved in:
Main Authors: | CAO, Rui, JIANG, Jing |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8307 https://ink.library.smu.edu.sg/context/sis_research/article/9310/viewcontent/ACL_Findings_Camera_Ready.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Knowledge generation for zero-shot knowledge-based VQA
by: CAO, Rui, et al.
Published: (2024) -
Zero-shot out-of-distribution detection with outlier label exposure
by: DING, Choubo, et al.
Published: (2024) -
Zero-shot object counting with good exemplars
by: ZHU, Huilin, et al.
Published: (2024) -
Learning adversarial semantic embeddings for zero-shot recognition in open worlds
by: LI, Tianqi, et al.
Published: (2024) -
FEW-SHOT IMAGE RECOGNITION AND OBJECT DETECTION
by: LI YITING
Published: (2023)