Modularized zero-shot VQA with pre-trained models

Modularized zero-shot VQA with pre-trained models

Large-scale pre-trained models (PTMs) show great zero-shot capabilities. In this paper, we study how to leverage them for zero-shot visual question answering (VQA).Our approach is motivated by a few observations. First, VQA questions often require multiple steps of reasoning, which is still a capabi...

Full description

Saved in:

Bibliographic Details
Main Authors:	CAO, Rui, JIANG, Jing
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Computational linguistics Zero-shot learning Object detection Artificial Intelligence and Robotics
Online Access:	https://ink.library.smu.edu.sg/sis_research/8307 https://ink.library.smu.edu.sg/context/sis_research/article/9310/viewcontent/ACL_Findings_Camera_Ready.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Knowledge generation for zero-shot knowledge-based VQA
by: CAO, Rui, et al.
Published: (2024)

Zero-shot out-of-distribution detection with outlier label exposure
by: DING, Choubo, et al.
Published: (2024)

Zero-shot object counting with good exemplars
by: ZHU, Huilin, et al.
Published: (2024)

Expandable-RCNN: toward high-efficiency incremental few-shot object detection
by: Li, Y, et al.
Published: (2024)

Learning adversarial semantic embeddings for zero-shot recognition in open worlds
by: LI, Tianqi, et al.
Published: (2024)

FEW-SHOT IMAGE RECOGNITION AND OBJECT DETECTION
by: LI YITING
Published: (2023)

Contextual human object interaction understanding from pre-trained large language model
by: Gao ,Jianjun, et al.
Published: (2025)

Improving zero-shot learning baselines with commonsense knowledge
by: Roy, Abhinaba, et al.
Published: (2023)

AnomalyCLIP: Object-agnostic prompt learning for zero-shot anomaly detection
by: ZHOU, Qihang, et al.
Published: (2024)

Zero-shot learning via category-specific visual-semantic mapping and label refinement
by: Niu, Li, et al.
Published: (2020)

Meta-transfer learning for few-shot learning
by: SUN, Qianru, et al.
Published: (2019)

CHRONOS: Time-aware zero-shot identification of libraries from vulnerability reports
by: LYU, Yunbo, et al.
Published: (2023)

Prompt to be consistent is better than self-consistent? Few-shot and zero-shot fact verification with pre-trained language models
by: ZENG, Fengzhu, et al.
Published: (2023)

Learning to self-train for semi-supervised few-shot classification
by: LI, Xinzhe, et al.
Published: (2019)

Virtual prompt pre-training for prototype-based few-shot relation extraction
by: He, Kai, et al.
Published: (2023)

Holistically associated transductive zero-shot learning
by: XU, Yangyang, et al.
Published: (2022)

PNPDet : efficient few-shot detection without forgetting via Plug-and-Play sub-networks
by: Zhang, Gongjie, et al.
Published: (2021)

Zero-shot text classification via self-supervised tuning
by: Liu, Chaoqun, et al.
Published: (2023)

Transductive zero-shot action recognition via visually connected graph convolutional networks
by: XU, Yangyang, et al.
Published: (2021)

Relative and absolute location embedding for few-shot node classification on graph
by: LIU, Zemin, et al.
Published: (2021)

Context-aware adapter tuning for few-shot relation learning in knowledge graphs
by: LIU, Ran, et al.
Published: (2024)

ROME: Evaluating pre-trained vision-language models on reasoning beyond visual common sense
by: ZHOU, Kankan, et al.
Published: (2023)

NumGPT: Improving numeracy ability of generative pre-trained models
by: JIN, Zhihua, et al.
Published: (2023)

Zero-shot ingredient recognition by multi-relational graph convolutional network
by: CHEN, Jingjing, et al.
Published: (2020)

Shot change detection using scene-based constraint
by: Cheong, L.-F., et al.
Published: (2014)

Shot change detection using scene-based constraint
by: Cheong, L.-F., et al.
Published: (2014)

SINet: A scale-insensitive convolutional neural network for fast vehicle detection
by: HU, Xiaowei, et al.
Published: (2019)

Learning to Self-Train for Semi-Supervised Few-Shot Classification
by: Xinzhe Li, et al.
Published: (2020)

Counterfactual zero-shot and open-set visual recognition
by: YUE, Zhongqi, et al.
Published: (2021)

Few-shot vision recognition and generation for the open-world
by: Song, Nan
Published: (2024)

INTELLIGENT MULTI SHOT MOLD-PARTING DESIGN
by: LI MING
Published: (2010)

Few-shot learning in Wi-Fi-based indoor positioning
by: Xie, Feng, et al.
Published: (2024)

Toward generalist anomaly detection via in-context residual learning with few-shot sample prompts
by: ZHU, Jiawen, et al.
Published: (2024)

Unlocking the capabilities of explainable few‑shot learning in remote sensing
by: Lee, Gao Yu, et al.
Published: (2024)

Meta-transfer learning through hard tasks
by: SUN, Qianru, et al.
Published: (2022)

ZeroBN : learning compact neural networks for latency-critical edge systems
by: Huai, Shuo, et al.
Published: (2022)

Semantic reasoning in zero example video event retrieval
by: DE BOER, M. H. T., et al.
Published: (2017)

Terrace-based food counting and segmentation
by: NGUYEN, Huu-Thanh, et al.
Published: (2021)

Surface integrity study of shot peened IN718
by: Nur Harith Bin Sazeli
Published: (2024)

Multiobjective linear ensembles for robust and sparse training of few-bit neural networks
by: BERNARDELLI, Ambrogio Maria, et al.
Published: (2024)