SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS

In the world of football, video analysis of matches, particularly the identification of actions within football matches, is a crucial aspect for observing and comprehending gameplay patterns, analyzing tactics, player training, and team strategy considerations. Traditionally, video analysis has b...

Full description

Saved in:
Bibliographic Details
Main Author: Ibnu Syah Hafizh, M.
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/76664
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:76664
spelling id-itb.:766642023-08-17T08:22:47ZSMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS Ibnu Syah Hafizh, M. Indonesia Final Project Football, Action Recognition, Deep Learning, Temporal Context Aggregation. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/76664 In the world of football, video analysis of matches, particularly the identification of actions within football matches, is a crucial aspect for observing and comprehending gameplay patterns, analyzing tactics, player training, and team strategy considerations. Traditionally, video analysis has been manually conducted by analysts or coaches, demanding significant time and effort. However, with technological advancements, especially in the field of machine learning and pattern recognition, this analysis process can be optimized, and its efficiency enhanced. One potential technology that could play a role in addressing this issue is deep learning. Deep learning is a subfield of artificial intelligence. Currently, several deep learning models exist for recognizing actions in football match videos, such as NetVLAD, AudioVid, and Context-Aware Loss Function (CALF). In this research, a novel approach is proposed, called Temporal Context Aggregation (TCA), a deep learning model inspired by NetVLAD and CALF. TCA considers the temporal context in action learning and recognition. The temporal context surrounding an action is a critical factor in understanding the actions occurring in a video, as it contains information about the sequence and duration of the actions. In this research, the TCA model is trained using football match video data and action labels obtained from the SoccerNet dataset. Subsequently, a series of experiments are conducted on the trained model to test the performance of the TCA model. The results show that this model achieves an average mAP (mean Average Precision) score of 51.62%, outperforming existing action recognition methods. Through this study, it is hoped to provide new insights and broaden the scope of deep learning technology applications in football match video analysis. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description In the world of football, video analysis of matches, particularly the identification of actions within football matches, is a crucial aspect for observing and comprehending gameplay patterns, analyzing tactics, player training, and team strategy considerations. Traditionally, video analysis has been manually conducted by analysts or coaches, demanding significant time and effort. However, with technological advancements, especially in the field of machine learning and pattern recognition, this analysis process can be optimized, and its efficiency enhanced. One potential technology that could play a role in addressing this issue is deep learning. Deep learning is a subfield of artificial intelligence. Currently, several deep learning models exist for recognizing actions in football match videos, such as NetVLAD, AudioVid, and Context-Aware Loss Function (CALF). In this research, a novel approach is proposed, called Temporal Context Aggregation (TCA), a deep learning model inspired by NetVLAD and CALF. TCA considers the temporal context in action learning and recognition. The temporal context surrounding an action is a critical factor in understanding the actions occurring in a video, as it contains information about the sequence and duration of the actions. In this research, the TCA model is trained using football match video data and action labels obtained from the SoccerNet dataset. Subsequently, a series of experiments are conducted on the trained model to test the performance of the TCA model. The results show that this model achieves an average mAP (mean Average Precision) score of 51.62%, outperforming existing action recognition methods. Through this study, it is hoped to provide new insights and broaden the scope of deep learning technology applications in football match video analysis.
format Final Project
author Ibnu Syah Hafizh, M.
spellingShingle Ibnu Syah Hafizh, M.
SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
author_facet Ibnu Syah Hafizh, M.
author_sort Ibnu Syah Hafizh, M.
title SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
title_short SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
title_full SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
title_fullStr SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
title_full_unstemmed SMART FOOTBALL: DEEP LEARNING FOR ACTION RECOGNITION IN FOOTBALL VIDEOS
title_sort smart football: deep learning for action recognition in football videos
url https://digilib.itb.ac.id/gdl/view/76664
_version_ 1822995010903080960