Compositional prompting video-language models to understand procedure in instructional videos

Instructional videos are very useful for completing complex daily tasks, which naturally contain abundant clip-narration pairs. Existing works for procedure understanding are keen on pretraining various video-language models with these pairs and then fine-tuning downstream classifiers and localizers...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hu, Guyue, He, Bin, Zhang, Hanwang
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Prompt Learning Instructional Videos
Online Access:	https://hdl.handle.net/10356/168985
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Internet

https://hdl.handle.net/10356/168985

Compositional prompting video-language models to understand procedure in instructional videos

Internet

Similar Items