Grounding referring expression in computer vision
This project studies the integration of language and vision in computer vision, focusing on Grounding Referring Expressions utilising the state-of-the-art GroundingDINO model. We address the topic of object identification and segmentation, emphasising zero-shot models’ ability to recognise items...
Saved in:
Main Author: | Yuen, Shaun Chien Wee |
---|---|
Other Authors: | Hanwang Zhang |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/174979 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Grounding referring expressions in images by variational context
by: Zhang, Hanwang, et al.
Published: (2020) -
PrefAce: face-centric pretraining with self-structure aware distillation
by: Hu, Siyuan
Published: (2024) -
Road detection using intrinsic colors in a stereo vision system
by: DONG SI TUE CUONG
Published: (2010) -
Enhancing visual grounding in vision-language pre-training with position-guided text prompts
by: WANG, Alex Jinpeng, et al.
Published: (2024) -
Grounding referring expressions in images with neural module tree network
by: Tan, Kuan Yeow
Published: (2022)