Declaration-based prompt tuning for visual question answering

Declaration-based prompt tuning for visual question answering

In recent years, the pre-training-then-fine-tuning paradigm has yielded immense success on a wide spectrum of cross-modal tasks, such as visual question answering (VQA), in which a visual-language (VL) model is first optimized via self-supervised task objectives, e.g., masked language modeling (MLM)...

Full description

Saved in:

Bibliographic Details
Main Authors:	LIU, Yuhang, WEI, Wei, ZHU, Feida
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Machine Learning: Multi-modal learning Computer Vision: Transfer low-shot semi- and un- supervised learning Computer Vision: Vision and language Natural Language Processing: Question Answering Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/7752 https://ink.library.smu.edu.sg/context/sis_research/article/8755/viewcontent/declaration.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Discovering high quality answers in community question answering archives using a hierarchy of classifiers
by: Toba, H., et al.
Published: (2014)

Question answering using evolving networks
by: See, Solomon Lim, et al.
Published: (2004)

Knowledge base question answering with topic units
by: LAN, Yunshi, et al.
Published: (2019)

ON ANNOTATION EFFICIENT LEARNING FOR COMPUTER VISION TASKS AND ITS APPLICATION ON MEDICAL IMAGE DATASETS
by: ATIN GHOSH
Published: (2022)

Retrieving questions and answers in community-based question answering services
by: WANG KAI
Published: (2011)

Joint learning of answer selection and answer summary generation in community question answering
by: DENG, Yang, et al.
Published: (2020)

CHALLENGES OF LEARNING UNDER DIFFERENT LEVELS OF SUPERVISION FOR IMAGES AND VIDEOS
by: RAHUL RAHAMAN
Published: (2023)

QUESTION ANSWERING USING DEEP NEURAL NETWORKS: SINGLE TURN AND BEYOND
by: SOUVIK KUNDU
Published: (2020)

Unifying text, tables, and images for multimodal question answering
by: LUO, Haohao, et al.
Published: (2023)

Answers or no answers : studying question answerability in stack overflow
by: Chua, Alton Yeow Kuan, et al.
Published: (2020)

Who You Are Decides How You Tell
by: WU SHUANG, et al.
Published: (2020)

SAMPLE EFFICIENT REPRESENTATION LEARNING FOR VISUAL RECOGNITION
by: KANG BINGYI
Published: (2021)

Aggregated community question answering
by: Snehasish Banerjee, et al.
Published: (2015)

Multimedia question answering
by: NIE LIQIANG
Published: (2013)

Deep-learning-based automated building construction progress monitoring for prefabricated prefinished volumetric construction
by: Chua, Wei Png, et al.
Published: (2025)

Technical Q8A site answer recommendation via question boosting
by: GAO, Zhipeng, et al.
Published: (2021)

Dynamic fusion with intra-and inter-modality attention flow for visual question answering
by: GAO, Peng, et al.
Published: (2019)

Resource-efficient learning for vision-capable neural models
by: Tiong, Anthony Meng Huat
Published: (2024)

Applying semantic analysis to finding similar questions in community question answering systems
by: NGUYEN LE NGUYEN
Published: (2010)

Visual questioning and answering
by: Ong, Zavier Jian Le
Published: (2024)

From text question-answering to multimedia QA on web-scale media resources
by: Chua, T.-S., et al.
Published: (2013)

TOWARDS AUTOMATED AND ANNOTATION-EFFICIENT MEDICAL IMAGE ANALYSIS
by: ZHU LEI
Published: (2022)

Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
by: Chan, Patrick Matthew J.
Published: (2022)

Learning to teach and learn for semi-supervised few-shot image classification
by: LI, Xinzhe, et al.
Published: (2021)

Modelling domain relationships for transfer learning on retrieval-based question answering systems in e-commerce
by: YU, Jianfei, et al.
Published: (2018)

Multi-hop knowledge base question answering with an iterative sequence matching model
by: LAN, Yunshi, et al.
Published: (2019)

IMAGE INFORMATION LOAD AND ONLINE SALES
by: ZHANG KANGHUA
Published: (2023)

Interesting nuggets and their impact on definitional question answering
by: Kor, K.-W., et al.
Published: (2013)

Segmentation of multi-sentence questions: Towards effective question retrieval in cQA services
by: Wang, K., et al.
Published: (2013)

Soft matching for question answering
by: CUI HANG
Published: (2010)

TOWARDS ADVERSARIAL ROBUSTNESS OF DEEP VISION ALGORITHMS
by: YAN HANSHU
Published: (2022)

Soft pattern matching models for definitional question answering
by: Cui, H., et al.
Published: (2013)

Video reference: A video question answering engine
by: Gao, L., et al.
Published: (2013)

Complex knowledge base question answering: A survey
by: LAN, Yunshi, et al.
Published: (2023)

Quality-aware collaborative Question Answering: Methods and evaluation
by: SURYANTO, Maggy Anastasia, et al.
Published: (2009)

LOVA3 : Learning to visual question answering, asking and assessment
by: ZHAO, Henry Hengyuan, et al.
Published: (2024)

Robotic grasping of novel objects based on a feature detection algorithm trained on minimal data
by: Khor, Kai Sherng
Published: (2024)

FACIAL LANDMARK DETECTION TOWARDS ROBUSTNESS
by: XIAO SHENGTAO
Published: (2017)

Video reference: Question answering on YouTube
by: Li, G., et al.
Published: (2013)

Using pre-trained models for vision-language understanding tasks
by: CAO, Rui
Published: (2024)