Enhancing performance in video grounding tasks through the use of attention module

Enhancing performance in video grounding tasks through the use of attention module

This report investigates improving video grounding tasks through the use of attention mechanisms, tackling the issue of sparse annotations in video datasets. Drawing inspiration from the MMN model \cite{wang2021_negative_2dmap}, we developed a modified model based on the open-source MMN codebase and...

Full description

Saved in:

Bibliographic Details
Main Author:	Do Duc Anh
Other Authors:	Sun Aixin
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science
Online Access:	https://hdl.handle.net/10356/181703
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Enhancing performance in video grounding tasks through the use of captions
by: Liu, Xinran
Published: (2024)

Enhancing perceptual and attentional skills requires common demands between the action video games and transfer tasks
by: Oei, Adam C., et al.
Published: (2015)

Learning unsupervised video object segmentation through visual attention
by: WANG, Wenguan, et al.
Published: (2019)

Effects of task-based attentional modulation on reaction time (RT) of basketball players
by: Tan, Jing Yi
Published: (2022)

Finding visual attention regions in videos
by: Ang, Kenny Wen Bin
Published: (2010)

Towards efficient video-based action recognition: context-aware memory attention network
by: Koh, Thean Chun, et al.
Published: (2024)

Towards temporal sentence grounding in videos
by: Zhang, Hao
Published: (2022)

Detection of visual attention regions in images and videos
by: Hu, Yiqun
Published: (2009)

Aggregating intrinsic information to enhance BCI performance through federated learning
by: Liu, Rui, et al.
Published: (2024)

Generalizability of EEG-based mental attention modeling with multiple cognitive tasks
by: Phyo Wai, Aung Aung, et al.
Published: (2021)

Paying attention to video object pattern understanding
by: WANG, Wenguan, et al.
Published: (2021)

A novel transformer for attention decoding using EEG
by: Lee, Joon Hei
Published: (2024)

Multimodal transformer networks for end-to-end video-grounded dialogue systems
by: LE, Hung, et al.
Published: (2019)

Enhancement of learning through the use of video protocols.
by: Ong, Eunice Mei Jing.
Published: (2013)

EFFECT OF WORKING MEMORY CONTENT ON ATTENTION CAPTURE TASK PERFORMANCE
by: FOONG KAI JING CLARENCE
Published: (2018)

Multi-task learning with multi-view attention for answer selection and knowledge base question answering
by: DENG, Yang, et al.
Published: (2019)

Enhanced multi-task learning architecture for detecting pedestrian at far distance
by: Zhou, Chengju, et al.
Published: (2024)

Enhancing play-out performance for internet video communications
by: Yip, See Wai.
Published: (2011)

Deep learning for video-grounded dialogue systems
by: LE, Hung
Published: (2022)

Attention-based histological image analysis
by: Wang, Jerome Jie Rui
Published: (2024)

Synthetic image generation and the use of virtual environments for image enhancement tasks
by: Del Gallego, Neil Patrick
Published: (2023)

SegEQA : video segmentation based visual attention for embodied question answering
by: Luo, Haonan, et al.
Published: (2020)

MuLAN: multi-level attention-enhanced matching network for few-shot knowledge graph completion
by: Li, Qianyu, et al.
Published: (2024)

Grounding referring expressions in images with neural module tree network
by: Tan, Kuan Yeow
Published: (2022)

Graph neural network with self-attention and multi-task learning for credit default risk prediction
by: LI, Zihao, et al.
Published: (2022)

Enhancing the performance of selectivity estimation with machine learning techniques
by: Meng, Zizhong
Published: (2024)

Personality Correlates of Sustained Attention Performance in a Low Loading Task
by: EVANIA TAN LI MIN
Published: (2013)

Designing a brain-computer interface platform for attention training
by: Zou, Zeren
Published: (2024)

Neurofeedback games to enhance human attentiveness
by: Xiang, Qiuyu
Published: (2014)

Attention-based sound classification pipeline with sound spectrum
by: Tan, Ki In, et al.
Published: (2024)

Neural image and video captioning
by: Lam, Ting En
Published: (2024)

CNN-Based Classification for Highly Similar Vehicle Model Using Multi-Task Learning
by: Avianto, Donny, et al.
Published: (2022)

Grounding referring expression in computer vision
by: Yuen, Shaun Chien Wee
Published: (2024)

Evaluating vision-language models long-chain reasoning ability with multiple ground truths
by: Setiadharma, Christopher Arif
Published: (2024)

Social-enhanced Attentive Group Recommendation
by: Da Cao, et al.
Published: (2020)

Delving deep into many-to-many attention for few-shot video object segmentation
by: CHEN, Haoxin, et al.
Published: (2021)

MAE-VQA: an efficient and accurate end-to-end video quality assessment method for user generated content videos
by: Wang, Chuhan
Published: (2024)

Modulo video recovery with deep learning
by: Li,Zike
Published: (2024)

Deep pixel-level matching via attention for video co-segmentation
by: LI, Junliang, et al.
Published: (2020)

SchEDUhelp: a personalised task & schedule manager for SCSE
by: Goh, Jamie Jie Min
Published: (2024)