ROME: Evaluating pre-trained vision-language models on reasoning beyond visual common sense

ROME: Evaluating pre-trained vision-language models on reasoning beyond visual common sense

Humans possess a strong capability for reasoning beyond common sense. For example, given an unconventional image of a goldfish laying on the table next to an empty fishbowl, a human would effortlessly determine that the fish is not inside the fishbowl. The case, however, may be different for a visio...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHOU, Kankan, LAI, Eason, YEONG, Au Wei Bin, MOURATIDIS, Kyriakos, JIANG, Jing
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Artificial Intelligence and Robotics
Online Access:	https://ink.library.smu.edu.sg/sis_research/8352 https://ink.library.smu.edu.sg/context/sis_research/article/9355/viewcontent/2023.findings_emnlp.683.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

VLStereoSet: A study of stereotypical bias in pre-trained vision-language models
by: ZHOU, Kankan, et al.
Published: (2022)

Using pre-trained models for vision-language understanding tasks
by: CAO, Rui
Published: (2024)

Position-guided text prompt for vision-language pre-training
by: WANG, Alex Jinpeng, et al.
Published: (2023)

Enhancing visual grounding in vision-language pre-training with position-guided text prompts
by: WANG, Alex Jinpeng, et al.
Published: (2024)

Letter From Rome
by: Calderone, S.J.
Published: (1963)

Sentic activation: A two-level affective common sense reasoning framework
by: Cambria, E., et al.
Published: (2014)

Evaluating vision-language models long-chain reasoning ability with multiple ground truths
by: Setiadharma, Christopher Arif
Published: (2024)

Robust optimization made easy with ROME
by: Goh, J., et al.
Published: (2013)

PRE-INDUSTRIAL DESIGN OF ROME BEAUTY APPLE (MALUS SYLVESTRIS MILL.) WITH EDIBLE COATING OF CARRAGEENAN AND GLYCEROL: EDIBLE COATING AT VARIOUS STORAGE TEMPERATURES ON THE QUALITY OF ROME BEAUTY APPLES (MALUS SYLVESTRIS MILL.)
by: Kevin Budisetyono, Christopher

Sentic neural networks: A novel cognitive model for affective common sense reasoning
by: Mazzocco, T., et al.
Published: (2014)

Switching between different ways to think: Multiple approaches to affective common sense reasoning
by: Cambria, E., et al.
Published: (2014)

VISUAL QUESTION ANSWERING REASONING SYNTHETIC DATA GENERATION USING LARGE VISION LANGUAGE MODEL
by: Amadeus Irawan, Patrick

Atopic patients who fulfilled Rome III criteria for irritable bowel syndRome had higher animal danders sensitization
by: Siah, K.T.H., et al.
Published: (2021)

Malaysia and the Rome Statute : Domestic Debate Over?
by: Waikar, Prashant
Published: (2019)

Rome at its Height: Roman Imperial Civilization
by: Malley, William J.
Published: (1960)

A common approach for consumer and provider fairness in recommendations
by: Sacharidis, Dimitris, et al.
Published: (2019)

On the transferability of pre-trained language models for low-resource programming languages
by: CHEN, Fuxiang, et al.
Published: (2022)

A Reporter in Rome: How the Catholic Church is Governed
by: Aguilar, Pablo V.
Published: (1961)

Choice of law for unjust enrichment/restitution and the Rome II regulation
by: CHONG, Adeline
Published: (2008)

Irritable bowel syndrome and the Rome III criteria: For better or for worse?
by: Gwee, K.-A.
Published: (2016)

A Commons beyond the Human
by: Er, Yanbing
Published: (2023)

Injecting descriptive meta-information into pre-trained language models with hypernetworks
by: DUAN, Wenying, et al.
Published: (2021)

Pre-training model based on the transfer learning in natural language processing
by: Tang, Jiayi
Published: (2019)

European news vision : 1993 and beyond
by: Lansipuro, Yrjo
Published: (2008)

Do pre-trained models benefit knowledge graph completion? A reliable evaluation and a reasonable approach
by: LV, Xin, et al.
Published: (2022)

Helminthic invasion of the central nervous system: Many roads lead to Rome
by: Juri Katchanov, et al.
Published: (2018)

Beyond Minority Report: Pre-Crime, Pre-punishment and Pre-desert
by: WILLIAMS, John N.
Published: (2012)

Beyond Minority Report: Pre-Crime, Pre-Punishment and Pre-Desert
by: WILLIAMS, John N.
Published: (2006)

On the usage of continual learning for out-of-distribution generalization in pre-trained language models of code
by: WEYSSOW, Martin, et al.
Published: (2023)

Fake review detection by fusing parameter efficient adapters in pre-trained language model
by: Ho, See Cheng
Published: (2024)

Synergizing Large Language Models and pre-trained smaller models for conversational intent discovery
by: LIANG, Jinggui, et al.
Published: (2024)

Contextual human object interaction understanding from pre-trained large language model
by: Gao ,Jianjun, et al.
Published: (2025)

KONSTRUKSI TAKSONOMI OTOMATIS MENGGUNAKAN PRE-TRAINED LANGUAGE MODEL UNTUK BAHASA INDONESIA
by: Faturrahman, Ridwan

LOW-RESOURCE CLICKBAIT SPOILING FOR INDONESIAN USING MULTILINGUAL PRE-TRAINED LANGUAGE MODELS
by: Putu Intan Maharani, Ni

Emergent semantic segmentation: training-free dense-label-free extraction from vision-language models
by: Luo, Jiayun
Published: (2024)

Vision language representation learning
by: Yang, Xiaofeng
Published: (2023)

Reasoning about complex agent knowledge - Ontologies, Uncertainty, rules and beyond
by: FENG YUZHANG
Published: (2011)

Reliability and validity of Thai version ROME III questionnaire for children with functional gastrointestinal disorders
by: Thitima Ngoenmak, et al.
Published: (2018)

Validity and reliability of the Thai version of Rome IV diagnostic questionnaires for pediatric gastrointestinal disorders
by: S. Siajunboriboon, et al.
Published: (2020)

THE PROBLEMS WITH INTERNATIONAL CRIMINAL JUSTICE: THE AFRICAN NATIONS AND THEIR PUSH FOR WITHDRAWAL FROM THE ROME STATUTE
by: HARSH MAHASETH
Published: (2020)