Deep historical long short-term memory network for action recognition

Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is...

Full description

Saved in:
Bibliographic Details
Main Authors: Cai, Jiaxin, Hu, Junlin, Tang, Xin, Hung,Tzu-Yi, Tan, Yap Peng
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160970
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160970
record_format dspace
spelling sg-ntu-dr.10356-1609702022-08-10T01:32:11Z Deep historical long short-term memory network for action recognition Cai, Jiaxin Hu, Junlin Tang, Xin Hung,Tzu-Yi Tan, Yap Peng School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Action Recognition Deep Learning Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is important for the analysis of some sports video such as tennis matches. Hence, in this paper, we proposed a deep historical long short-term memory network for video-based tennis action recognition and general action recognition. First, the spatial representations are extracted from each frame using a pre-trained convolutional neural network (CNN). To describe the temporal information, a stacked multi-layer long short-term memory network (LSTM) was used. The historical information of the past frames is important for modeling the action. So we propose a historical information layer that is added to the top of the multi-layered LSTM network. A historical feature of each video is generated for classification by hybridizing the hidden state of LSTM at time t and the historical updated feature at time t-1 with an updating scheme and utilized for classification. Experiments on the benchmark datasets demonstrate that our method that using only simple raw RGB video can outperform the state-of-the-art baselines for both general action recognition and tennis action recognition. National Research Foundation (NRF) This work was conducted within the Delta-NTU Corporate Lab for Cyber-Physical Systems with funding support from Delta Electronics Inc and the National Research Foundation (NRF) Singapore under the Corp Lab@University Scheme. This work was also supported by the Education and scientific research project for young and middle-aged teachers of Fujian Province (No.JAT190666), the Xiamen University of Technology Scientific Research Climbing Project (No. XPDKQ19003) and the Xiamen University of Technology High Level Talents Project (No. YKJ15018R). 2022-08-10T01:32:11Z 2022-08-10T01:32:11Z 2020 Journal Article Cai, J., Hu, J., Tang, X., Hung, T. & Tan, Y. P. (2020). Deep historical long short-term memory network for action recognition. Neurocomputing, 407, 428-438. https://dx.doi.org/10.1016/j.neucom.2020.03.111 0925-2312 https://hdl.handle.net/10356/160970 10.1016/j.neucom.2020.03.111 2-s2.0-85086499696 407 428 438 en Neurocomputing © 2020 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Action Recognition
Deep Learning
spellingShingle Engineering::Electrical and electronic engineering
Action Recognition
Deep Learning
Cai, Jiaxin
Hu, Junlin
Tang, Xin
Hung,Tzu-Yi
Tan, Yap Peng
Deep historical long short-term memory network for action recognition
description Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is important for the analysis of some sports video such as tennis matches. Hence, in this paper, we proposed a deep historical long short-term memory network for video-based tennis action recognition and general action recognition. First, the spatial representations are extracted from each frame using a pre-trained convolutional neural network (CNN). To describe the temporal information, a stacked multi-layer long short-term memory network (LSTM) was used. The historical information of the past frames is important for modeling the action. So we propose a historical information layer that is added to the top of the multi-layered LSTM network. A historical feature of each video is generated for classification by hybridizing the hidden state of LSTM at time t and the historical updated feature at time t-1 with an updating scheme and utilized for classification. Experiments on the benchmark datasets demonstrate that our method that using only simple raw RGB video can outperform the state-of-the-art baselines for both general action recognition and tennis action recognition.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Cai, Jiaxin
Hu, Junlin
Tang, Xin
Hung,Tzu-Yi
Tan, Yap Peng
format Article
author Cai, Jiaxin
Hu, Junlin
Tang, Xin
Hung,Tzu-Yi
Tan, Yap Peng
author_sort Cai, Jiaxin
title Deep historical long short-term memory network for action recognition
title_short Deep historical long short-term memory network for action recognition
title_full Deep historical long short-term memory network for action recognition
title_fullStr Deep historical long short-term memory network for action recognition
title_full_unstemmed Deep historical long short-term memory network for action recognition
title_sort deep historical long short-term memory network for action recognition
publishDate 2022
url https://hdl.handle.net/10356/160970
_version_ 1743119500770803712