Triadic temporal-semantic alignment for weakly-supervised video moment retrieval

Video Moment Retrieval (VMR) aims to identify specific event moments within untrimmed videos based on natural language queries. Existing VMR methods have been criticized for relying heavily on moment annotation bias rather than true multi-modal alignment reasoning. Weakly supervised VMR approaches i...

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Jin, XIE, JiaLong, ZHOU, Fengyu, HE, Shengfeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9286
https://ink.library.smu.edu.sg/context/sis_research/article/10286/viewcontent/ssrn_4726553.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English