GroundNLQ @ Ego4D natural language queries challenge 2023
In this report, we present our champion solution for Ego4D Natural Language Queries (NLQ) Challenge in CVPR 2023. Essentially, to accurately ground in a video, an effective egocentric feature extractor and a powerful grounding model are required. Motivated by this, we leverage a two-stage pre-traini...
Saved in:
Main Authors: | HOU, Zhijian, JI, Lei, GAO, Difei, ZHONG, Wanjun, YAN, Kun, NGO, Chong-wah, CHAN, Wing-Kwong, NGO, Chong-Wah, DUAN, Nan, SHOU, Mike Zheng |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8416 https://ink.library.smu.edu.sg/context/sis_research/article/9419/viewcontent/2306.15255.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
CONE: An efficient COarse-to-fiNE alignment framework for long video temporal grounding
by: HOU, Zhijian, et al.
Published: (2023) -
CONQUER: Contextual query-aware ranking for video corpus moment retrieval
by: HOU, Zhijian, et al.
Published: (2021) -
(Un)likelihood training for interpretable embedding
by: WU, Jiaxin, et al.
Published: (2023) -
Indexing and matching of polyphonic songs for query-by-singing system
by: LEUNG, Tat-Wan, et al.
Published: (2004) -
Clip-based similarity measure for query-dependent clip retrieval and video summarization
by: PENG, Yuxin, et al.
Published: (2006)