LOVA3 : Learning to visual question answering, asking and assessment
Question answering, asking, and assessment are three innate human traits crucial for understanding the world and acquiring knowledge. By enhancing these capabilities, humans can more effectively utilize data, leading to better comprehension and learning outcomes. Current Multimodal Large Language Mo...
Saved in:
Main Authors: | ZHAO, Henry Hengyuan, ZHOU, Pan, GAO, Difei, SHOU, BAI, SHOU, Mike Zheng |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9730 https://ink.library.smu.edu.sg/context/sis_research/article/10730/viewcontent/LoVA.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Genixer : Empowering multimodal Large Language Models as a powerful data generator
by: ZHAO, Henry Hengyuan, et al.
Published: (2024) -
Knowledge base question answering with topic units
by: LAN, Yunshi, et al.
Published: (2019) -
Context modeling with evidence filter for multiple choice question answering
by: YU, Sicheng, et al.
Published: (2022) -
QUESTION ANSWERING USING DEEP NEURAL NETWORKS: SINGLE TURN AND BEYOND
by: SOUVIK KUNDU
Published: (2020) -
Snap-and-ask: Answering multimodal question by naming visual instance
by: ZHANG, Wei, et al.
Published: (2012)