Deep historical long short-term memory network for action recognition
Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/160970 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-160970 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1609702022-08-10T01:32:11Z Deep historical long short-term memory network for action recognition Cai, Jiaxin Hu, Junlin Tang, Xin Hung,Tzu-Yi Tan, Yap Peng School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Action Recognition Deep Learning Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is important for the analysis of some sports video such as tennis matches. Hence, in this paper, we proposed a deep historical long short-term memory network for video-based tennis action recognition and general action recognition. First, the spatial representations are extracted from each frame using a pre-trained convolutional neural network (CNN). To describe the temporal information, a stacked multi-layer long short-term memory network (LSTM) was used. The historical information of the past frames is important for modeling the action. So we propose a historical information layer that is added to the top of the multi-layered LSTM network. A historical feature of each video is generated for classification by hybridizing the hidden state of LSTM at time t and the historical updated feature at time t-1 with an updating scheme and utilized for classification. Experiments on the benchmark datasets demonstrate that our method that using only simple raw RGB video can outperform the state-of-the-art baselines for both general action recognition and tennis action recognition. National Research Foundation (NRF) This work was conducted within the Delta-NTU Corporate Lab for Cyber-Physical Systems with funding support from Delta Electronics Inc and the National Research Foundation (NRF) Singapore under the Corp Lab@University Scheme. This work was also supported by the Education and scientific research project for young and middle-aged teachers of Fujian Province (No.JAT190666), the Xiamen University of Technology Scientific Research Climbing Project (No. XPDKQ19003) and the Xiamen University of Technology High Level Talents Project (No. YKJ15018R). 2022-08-10T01:32:11Z 2022-08-10T01:32:11Z 2020 Journal Article Cai, J., Hu, J., Tang, X., Hung, T. & Tan, Y. P. (2020). Deep historical long short-term memory network for action recognition. Neurocomputing, 407, 428-438. https://dx.doi.org/10.1016/j.neucom.2020.03.111 0925-2312 https://hdl.handle.net/10356/160970 10.1016/j.neucom.2020.03.111 2-s2.0-85086499696 407 428 438 en Neurocomputing © 2020 Elsevier B.V. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Action Recognition Deep Learning |
spellingShingle |
Engineering::Electrical and electronic engineering Action Recognition Deep Learning Cai, Jiaxin Hu, Junlin Tang, Xin Hung,Tzu-Yi Tan, Yap Peng Deep historical long short-term memory network for action recognition |
description |
Human action recognition technology has received increasing interest recently. The technology is very useful in sports video analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognizing of the specific action in videos is important for the analysis of some sports video such as tennis matches. Hence, in this paper, we proposed a deep historical long short-term memory network for video-based tennis action recognition and general action recognition. First, the spatial representations are extracted from each frame using a pre-trained convolutional neural network (CNN). To describe the temporal information, a stacked multi-layer long short-term memory network (LSTM) was used. The historical information of the past frames is important for modeling the action. So we propose a historical information layer that is added to the top of the multi-layered LSTM network. A historical feature of each video is generated for classification by hybridizing the hidden state of LSTM at time t and the historical updated feature at time t-1 with an updating scheme and utilized for classification. Experiments on the benchmark datasets demonstrate that our method that using only simple raw RGB video can outperform the state-of-the-art baselines for both general action recognition and tennis action recognition. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Cai, Jiaxin Hu, Junlin Tang, Xin Hung,Tzu-Yi Tan, Yap Peng |
format |
Article |
author |
Cai, Jiaxin Hu, Junlin Tang, Xin Hung,Tzu-Yi Tan, Yap Peng |
author_sort |
Cai, Jiaxin |
title |
Deep historical long short-term memory network for action recognition |
title_short |
Deep historical long short-term memory network for action recognition |
title_full |
Deep historical long short-term memory network for action recognition |
title_fullStr |
Deep historical long short-term memory network for action recognition |
title_full_unstemmed |
Deep historical long short-term memory network for action recognition |
title_sort |
deep historical long short-term memory network for action recognition |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/160970 |
_version_ |
1743119500770803712 |