Temporal sentence grounding in videos: a survey and future directions

Temporal sentence grounding in videos (TSGV), a.k.a., natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that semantically corresponds to a language query from an untrimmed video. Connecting computer vision and natural language, TSGV has dr...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Zhang, Hao, Sun, Aixin, Jing, Wei, Zhou, Joey Tianyi
مؤلفون آخرون:	School of Computer Science and Engineering
التنسيق:	مقال
اللغة:	English
منشور في:	2023
الموضوعات:	Engineering::Computer science and engineering Cross-Modal Video Retrieval Multimodal Learning
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/172187
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

الانترنت

https://hdl.handle.net/10356/172187

Temporal sentence grounding in videos: a survey and future directions

الانترنت

مواد مشابهة