Temporal sentence grounding in videos: a survey and future directions

Temporal sentence grounding in videos (TSGV), a.k.a., natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that semantically corresponds to a language query from an untrimmed video. Connecting computer vision and natural language, TSGV has dr...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Zhang, Hao, Sun, Aixin, Jing, Wei, Zhou, Joey Tianyi
其他作者:	School of Computer Science and Engineering
格式:	Article
語言:	English
出版:	2023
主題:	Engineering::Computer science and engineering Cross-Modal Video Retrieval Multimodal Learning
在線閱讀:	https://hdl.handle.net/10356/172187
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!

因特網

https://hdl.handle.net/10356/172187

Temporal sentence grounding in videos: a survey and future directions

因特網

相似書籍