Video analytics based on deep learning and information fusion technologies

In recent years, video analytics has risen to become a popular topic in the field of Artificial Intelligence. With the advancement in high-speed connection, machine learning algorithms and IoT technologies, the applications of video analytics using multiple modalities and information fusion technolo...

Full description

Saved in:

Bibliographic Details
Main Author:	Lee, Zheng Han
Other Authors:	Mao Kezhi
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/139262
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-139262
record_format	dspace
spelling	sg-ntu-dr.10356-1392622023-07-07T17:55:49Z Video analytics based on deep learning and information fusion technologies Lee, Zheng Han Mao Kezhi School of Electrical and Electronic Engineering ekzmao@ntu.edu.sg Engineering::Electrical and electronic engineering In recent years, video analytics has risen to become a popular topic in the field of Artificial Intelligence. With the advancement in high-speed connection, machine learning algorithms and IoT technologies, the applications of video analytics using multiple modalities and information fusion technologies is becoming a commodity to everyone in the Information Age and the coming future. Most studies done in this topic previously focused on pushing the boundaries of algorithms for the applications of information fusion, such as Audio-visual correspondence task (AVC) and video-scene segmentation. This study aims to explore the optimization of video analytics based on information fusion technologies by using C3D-based action recognition function as the benchmark for video analytics performance. By scrutinizing and testing the mechanisms and architectures of the C3D-based action model, the best performing elements and the reasons behind their performances are explored. The types of pooling, optimizer and scheduler and their respective accuracies with the dataset used are recorded. The different methods of fusion of visual-audio information and their introduction into the action recognition model are explored. Their executions and respective accuracies are studied to get insights on how they affect the model’s performance. The feature extraction methods for the audio modality with their respective performance are also studied. Different self-attention mechanisms involving the modalities and channels are implemented in the model and the resulting accuracies studied. These explorations provide understandings on how they affect the performance of video analytics based on information fusion and subsequently help to unleash its full potential. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-18T07:26:07Z 2020-05-18T07:26:07Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139262 en A1121-191 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Lee, Zheng Han Video analytics based on deep learning and information fusion technologies
description	In recent years, video analytics has risen to become a popular topic in the field of Artificial Intelligence. With the advancement in high-speed connection, machine learning algorithms and IoT technologies, the applications of video analytics using multiple modalities and information fusion technologies is becoming a commodity to everyone in the Information Age and the coming future. Most studies done in this topic previously focused on pushing the boundaries of algorithms for the applications of information fusion, such as Audio-visual correspondence task (AVC) and video-scene segmentation. This study aims to explore the optimization of video analytics based on information fusion technologies by using C3D-based action recognition function as the benchmark for video analytics performance. By scrutinizing and testing the mechanisms and architectures of the C3D-based action model, the best performing elements and the reasons behind their performances are explored. The types of pooling, optimizer and scheduler and their respective accuracies with the dataset used are recorded. The different methods of fusion of visual-audio information and their introduction into the action recognition model are explored. Their executions and respective accuracies are studied to get insights on how they affect the model’s performance. The feature extraction methods for the audio modality with their respective performance are also studied. Different self-attention mechanisms involving the modalities and channels are implemented in the model and the resulting accuracies studied. These explorations provide understandings on how they affect the performance of video analytics based on information fusion and subsequently help to unleash its full potential.
author2	Mao Kezhi
author_facet	Mao Kezhi Lee, Zheng Han
format	Final Year Project
author	Lee, Zheng Han
author_sort	Lee, Zheng Han
title	Video analytics based on deep learning and information fusion technologies
title_short	Video analytics based on deep learning and information fusion technologies
title_full	Video analytics based on deep learning and information fusion technologies
title_fullStr	Video analytics based on deep learning and information fusion technologies
title_full_unstemmed	Video analytics based on deep learning and information fusion technologies
title_sort	video analytics based on deep learning and information fusion technologies
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/139262
_version_	1772827881154543616

Video analytics based on deep learning and information fusion technologies

Similar Items