Contextual object detection with multimodal large language models

Contextual object detection with multimodal large language models

Recent Multimodal Large Language Models (MLLMs) are remarkable in vision-language tasks, such as image captioning and question answering, but lack the essential perception ability, i.e., object detection. In this work, we address this limitation by introducing a novel research problem of contextual...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zang, Yuhang, Li, Wei, Han, Jun, Zhou, Kaiyang, Loy, Chen Change
Other Authors:	College of Computing and Data Science
Format:	Article
Language:	English
Published:	2024
Subjects:	Computer and Information Science Image segmentation Object detection
Online Access:	https://hdl.handle.net/10356/181063
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Semi-supervised and long-tailed object detection with Cascadematch
by: Zang, Yuhang, et al.
Published: (2023)

Information-theoretic analysis of input strokes in visual object cutout
by: Mu, Y., et al.
Published: (2014)

Car cabin object detection using artificial intelligence (multimodal object detection)
by: Li, Ying
Published: (2024)

Hierarchical object groups for scene classification
by: Sadovnik A., et al.
Published: (2018)

Efficient salient region detection with soft image abstraction
by: CHENG, Ming-Ming, et al.
Published: (2013)

Contextual human object interaction understanding from pre-trained large language model
by: Gao ,Jianjun, et al.
Published: (2025)

Towards Unified Object Analytics
by: DONG JIAN
Published: (2014)

Development of robotic grasping of object detection model
by: Koh, Aloysius Jun Jie
Published: (2024)

Object detection and tracking (in car cabin)
by: Huda, Md Tanvirul
Published: (2024)

Mixed-dish recognition with contextual relation networks
by: DENG, Lixi, et al.
Published: (2019)

Object detection for mobile robots in adverse conditions
by: Liu, Tingtao
Published: (2024)

Video categorization using Object of Interest detection
by: Kowdle A., et al.
Published: (2018)

Open world object detection: a survey
by: Li, Yiming, et al.
Published: (2025)

Learning long-term structural dependencies for video salient object detection
by: WANG, Bo, et al.
Published: (2020)

OW-Mamba: Mamba for open world object detection
by: Sun, Heyuan
Published: (2024)

Example-based depth generation from single image for 3D content
by: Liu K.-C., et al.
Published: (2018)

Object detection in car cabin environment
by: Aarathy Ajay
Published: (2024)

Real-time object detection by cameras mounted on mobile robots on campus
by: Xu, Haozhe
Published: (2025)

Exploring tiny images: The roles of appearance and contextual information for machine and human object recognition
by: Parikh D., et al.
Published: (2018)

TOWARDS EFFICIENT OBJECT DETECTION WITH DEEP LEARNING
by: WANG, TAO
Published: (2023)

Edge Distraction-aware Salient Object Detection
by: REN, Sucheng, et al.
Published: (2023)

Fast and efficient method for fire detection using image processing
by: Celik, T.
Published: (2014)

Contour-based object detection as dominant set computation
by: Yang, X., et al.
Published: (2013)

Exploring different dehazing algorithms for object detection in foggy weather conditions for autonomous vehicles
by: Kim, Chae Yoon
Published: (2024)

Joint optimization of background subtraction and object detection for night surveillance
by: Li C., et al.
Published: (2018)

Delving into salient object subitizing and detection
by: HE, Shengfeng, et al.
Published: (2017)

Assemble new object detector with few examples
by: Yang, K., et al.
Published: (2013)

Seed Image Selection in interactive cosegmentation
by: Batra D., et al.
Published: (2018)

S-CNN : subcategory-aware convolutional networks for object detection
by: Chen, Tao, et al.
Published: (2020)

GEOMETRIC PATTERN DETECTION FOR COMPLEX IMAGE ANALYSIS INSPIRED BY HUMAN PERCEPTION
by: ZHENG XIAOXU
Published: (2020)

Reciprocal transformations for unsupervised video object segmentation
by: REN, Sucheng, et al.
Published: (2021)

FEW-SHOT IMAGE RECOGNITION AND OBJECT DETECTION
by: LI YITING
Published: (2023)

Multi-Objective Optimization Based Image Segmentation: Method and Applications
by: PERIASAMY KARTHIK RAJA
Published: (2012)

Simple image-level classification improves open-vocabulary object detection
by: FANG, Ruohuan, et al.
Published: (2024)

Learning deep networks for video object segmentation
by: Lim, Jun Rong
Published: (2024)

Paying attention to video object pattern understanding
by: WANG, Wenguan, et al.
Published: (2021)

Real-world object detection
by: Zang, Yuhang
Published: (2023)

Background cutout with automatic object discovery
by: Liu D., et al.
Published: (2018)

LiDAR-based 3D object detection and tracking for autonomous driving
by: Luo, Zhipeng
Published: (2024)

Improving object color categorization with shapes
by: Zhang Y., et al.
Published: (2018)