Enhancing performance in video grounding tasks through the use of attention module
This report investigates improving video grounding tasks through the use of attention mechanisms, tackling the issue of sparse annotations in video datasets. Drawing inspiration from the MMN model \cite{wang2021_negative_2dmap}, we developed a modified model based on the open-source MMN codebase and...
Saved in:
Main Author: | Do Duc Anh |
---|---|
Other Authors: | Sun Aixin |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181703 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Enhancing performance in video grounding tasks through the use of captions
by: Liu, Xinran
Published: (2024) -
Enhancing perceptual and attentional skills requires common demands between the action video games and transfer tasks
by: Oei, Adam C., et al.
Published: (2015) -
Learning unsupervised video object segmentation through visual attention
by: WANG, Wenguan, et al.
Published: (2019) -
Effects of task-based attentional modulation on reaction time (RT) of basketball players
by: Tan, Jing Yi
Published: (2022) -
Finding visual attention regions in videos
by: Ang, Kenny Wen Bin
Published: (2010)