Grounding referring expression in computer vision

Grounding referring expression in computer vision

This project studies the integration of language and vision in computer vision, focusing on Grounding Referring Expressions utilising the state-of-the-art GroundingDINO model. We address the topic of object identification and segmentation, emphasising zero-shot models’ ability to recognise items...

Full description

Saved in:

Bibliographic Details
Main Author:	Yuen, Shaun Chien Wee
Other Authors:	Hanwang Zhang
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Computer vision Grounding Artificial intelligence
Online Access:	https://hdl.handle.net/10356/174979
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Grounding referring expressions in images by variational context
by: Zhang, Hanwang, et al.
Published: (2020)

PrefAce: face-centric pretraining with self-structure aware distillation
by: Hu, Siyuan
Published: (2024)

Road detection using intrinsic colors in a stereo vision system
by: DONG SI TUE CUONG
Published: (2010)

Enhancing visual grounding in vision-language pre-training with position-guided text prompts
by: WANG, Alex Jinpeng, et al.
Published: (2024)

Grounding referring expressions in images with neural module tree network
by: Tan, Kuan Yeow
Published: (2022)

Deconfounded visual grounding
by: HUANG, Jianqiang, et al.
Published: (2022)

Skin beauty adviser assistant based on large language model and computer vision
by: Jiang, Yuwei
Published: (2025)

Are vision language models multimodal learners?
by: Lee, Gyeonggeon
Published: (2024)

Learning language to symbol and language to vision mapping for visual grounding
by: He, Su, et al.
Published: (2022)

Car cabin surveillance using computer vision
by: Soegeng, Andrew Ivan
Published: (2022)

Digital Theremin with computer vision
by: Chua, Ryuichi
Published: (2024)

Retrofitting a legacy cutlery washing machine using computer vision
by: FWA, Hua Leong
Published: (2024)

A binocular vision system for object tracking and distance perception based on optical convergence
by: Ke, Gaston Anthony N., et al.
Published: (2006)

Data efficient learning for 3D computer vision
by: Wei, Jiacheng
Published: (2023)

Computer Vision Systems
by: Gasteratos, Antonios ; Vincze, Markus ; Tsotsos, John K.
Published: (2017)

Abdominal palpation characterization using computer vision
by: Rosaldo, Aeysol F.
Published: (2016)

Computer Vision and Computer Graphics. Theory and Applications
Published: (2017)

Augmenting teacher noticing in science experiments: using computer vision to extract student activity information for science teachers
by: Chng, Edwin
Published: (2024)

A computer vision and smartphone-based tool for onsite color correction and dimensional measurements
by: Xiong, Wenxi
Published: (2024)

Transformers for computer vision
by: Deng, Yaojun
Published: (2022)

A computer vision sensor for efficient object detection under varying lighting conditions
by: Cuhadar, Can, et al.
Published: (2022)

Benchmarking neuromorphic vision: Lessons learnt from computer vision
by: Tan, C, et al.
Published: (2020)

Accelerating computer vision algorithms on heterogeneous edge computing platforms
by: Prakash, Alok, et al.
Published: (2021)

Vision-language-model-based video quality assessment
by: Zhang, Erli
Published: (2024)

An empirical study on adaptation methods for large-scale vision-language models
by: Wang, Annan
Published: (2023)

Sparsity Analysis for Computer Vision Applications
by: CHENG BIN
Published: (2013)

Enhancing performance in video grounding tasks through the use of captions
by: Liu, Xinran
Published: (2024)

YOLO-BAM: Integrating CBAM to the YOLOv3 Model for Pedestrian Detection in Images
by: Eclarinal, Jason, et al.
Published: (2023)

Vision language representation learning
by: Yang, Xiaofeng
Published: (2023)

Computer vision
by: Bondoc, Jaime, et al.
Published: (1990)

Vision transformer as image fusion model
by: Zhao, Fengye
Published: (2023)

Zero-shot object detection and referring expression comprehension using vision-language models
by: A Manicka, Praveen
Published: (2024)

Vergence control for a biologically inspired binocular active vision system
by: Zhang, Xuejie
Published: (2012)

Computer vision optimization on embedded GPU board
by: Li, Ziyang
Published: (2022)

Cat detection using computer vision
by: Chen, Zhe.
Published: (2012)

Cat detection using computer vision
by: Zhou, Ruihong.
Published: (2013)

STUDY OF GROUND SETTLEMENT INDUCED BY TUNNELLING AND DATA ENHANCEMENT FOR SETTLEMENT MONITORING USING LIDAR
by: HUANG LAN
Published: (2024)

Automatic recognition of facial expressions
by: Gee, Cheng Mun
Published: (2020)

Toward a grounded theory of game development work in the Philippines
by: Serrano, Elcid A., et al.
Published: (2018)

Vision System for Hand Gesture Recognition (VISOR)
by: Enero, Jerome M., et al.
Published: (2006)