Human action recognition in the dark based on transferring learning and LSTM

Today, information technology is pervasive worldwide. Throughout the years, it has undergone continuous transformation and will continue to evolve. From the initial invention of the computer to the development of software and artificial intelligence, a new era in human life has been unveiled. Bas...

Full description

Saved in:
Bibliographic Details
Main Author: Yang, Fan
Other Authors: Mohammed Yakoob Siyal
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171887
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Today, information technology is pervasive worldwide. Throughout the years, it has undergone continuous transformation and will continue to evolve. From the initial invention of the computer to the development of software and artificial intelligence, a new era in human life has been unveiled. Based on deep learning technology, a modern concept called “ smart city ” is proposed. A smart city is a municipality that uses information and communication technologies(ICT) to increase operational efficiency, share information with the public and improve both the quality of government services and citizen welfare. This study introduce an innovative method for conducting video classification in low-light conditions, specifically targeting Human Action Recognition. The majority of prior research has primarily concentrated on datasets characterized by ample brightness levels, facilitating ideal classification conditions. In contrast, the approach revolves around a series of hybrid classifiers, comprising both feature extraction and classification components. The deep learning network of this study, customized for Human Action Recognition in low-light environments, is founded on transfer learning and LSTM (Long Short-Term Memory), effectively balancing accuracy and execution speed requirements. This project encompasses the creation and deployment of a set of hybrid networks composed of diverse components, followed by a performance comparison. These networks harness classic deep learning architectures to extract essential feature vectors, contributing to the advancement of 'smart city' initiatives, ultimately enhancing social security while reducing the expenses associated with public security maintenance. The algorithm is implemented in MATLAB, enabling efficient and reliable computations. Subsequently, data analysis was carried out to assess the comparative effectiveness of various methodologies. The results revealed that the choice of logic significantly impacted accuracy and speed, resulting in superior levels of both. In the realm of classification networks, the author opted for BiLSTM (Bidirectional Long Short-Term Memory) instead of the standard LSTM. BiLSTM processes input sequences bidirectionally, harnessing information from both ends of the sequence, which contributes to enhanced accuracy and efficiency. The author conducted a thorough analysis of the achieved results and offered potential explanations. Additionally, this study have provided recommendations for future research endeavors aimed at addressing areas for improvement.