Boosting video representation learning with multi-faceted integration
Video content is multifaceted, consisting of objects, scenes, interactions or actions. The existing datasets mostly label only one of the facets for model training, resulting in the video representation that biases to only one facet depending on the training dataset. There is no study yet on how to...
Saved in:
Main Authors: | QIU, Zhaofan, TING, Yao, NGO, Chong-wah, ZHANG, Xiao-Ping, WU, Dong, MEI, Tao |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6808 https://ink.library.smu.edu.sg/context/sis_research/article/7811/viewcontent/cvpr21.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Condensing a sequence to one informative frame for video recognition
by: QIU. Zhaofan,, et al.
Published: (2021) -
Learning spatio-temporal representation with local and global diffusion
by: QIU, Zhaofan, et al.
Published: (2019) -
Dynamic temporal filtering in video models
by: LONG, Fuchen, et al.
Published: (2022) -
Click-boosting multi-modality graph-based reranking for image search
by: YANG, Xiaopeng, et al.
Published: (2015) -
Multi-facet in heterogeneous information network representation learning
by: Zhao, Tianqi
Published: (2020)