Simple image-level classification improves open-vocabulary object detection

Open-Vocabulary Object Detection (OVOD) aims to detect novel objects beyond a given set of base categories on which the detection model is trained. Recent OVOD methods focus on adapting the image-level pre-trained vision-language models (VLMs), such as CLIP, to a region-level object detection task v...

Full description

Saved in:
Bibliographic Details
Main Authors: FANG, Ruohuan, PANG, Guansong, BAI, Xiao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8744
https://ink.library.smu.edu.sg/context/sis_research/article/9747/viewcontent/27939_Article_Text_31993_1_2_20240324.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Be the first to leave a comment!
You must be logged in first