Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision
10.1109/TMM.2022.3204444
Saved in:
Main Authors: | Wang, X, Zhu, L, Zheng, Z, Xu, M, Yang, Y |
---|---|
Other Authors: | DEPARTMENT OF COMPUTER SCIENCE |
Format: | Article |
Published: |
Institute of Electrical and Electronics Engineers (IEEE)
2023
|
Online Access: | https://scholarbank.nus.edu.sg/handle/10635/245914 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | National University of Singapore |
Similar Items
-
Triadic temporal-semantic alignment for weakly-supervised video moment retrieval
by: LIU, Jin, et al.
Published: (2024) -
Paired cross-modal data augmentation for fine-grained image-to-text retrieval
by: Wang, Hao, et al.
Published: (2023) -
Automatic music video summarization based on audio-visual-text analysis and alignment
by: Xu, C., et al.
Published: (2014) -
Topical summarization of web videos by visual-text time-dependent alignment
by: TAN, Song, et al.
Published: (2010) -
CONE: An efficient COarse-to-fiNE alignment framework for long video temporal grounding
by: HOU, Zhijian, et al.
Published: (2023)