Boosting video representation learning with multi-faceted integration

Video content is multifaceted, consisting of objects, scenes, interactions or actions. The existing datasets mostly label only one of the facets for model training, resulting in the video representation that biases to only one facet depending on the training dataset. There is no study yet on how to...

Full description

Saved in:
Bibliographic Details
Main Authors: QIU, Zhaofan, TING, Yao, NGO, Chong-wah, ZHANG, Xiao-Ping, WU, Dong, MEI, Tao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6808
https://ink.library.smu.edu.sg/context/sis_research/article/7811/viewcontent/cvpr21.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English