Effective action recognition with embedded key point shifts
Temporal feature extraction is an essential technique in video-based action recognition. Key points have been utilized in skeleton-based action recognition methods but they require costly key point annotation. In this paper, we propose a novel temporal feature extraction module, named Key Point Shif...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162581 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162581 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1625812022-10-31T05:25:16Z Effective action recognition with embedded key point shifts Cao, Haozhi Xu, Yuecong Yang, Jianfei Mao, Kezhi Yin, Jianxiong See, Simon School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Action Recognition Temporal Feature Temporal feature extraction is an essential technique in video-based action recognition. Key points have been utilized in skeleton-based action recognition methods but they require costly key point annotation. In this paper, we propose a novel temporal feature extraction module, named Key Point Shifts Embedding Module (KPSEM), to adaptively extract channel-wise key point shifts across video frames without key point annotation. Key points are adaptively extracted as feature points with maximum feature values at split regions and key point shifts are the spatial displacements of corresponding key points. The key point shifts are encoded as the overall temporal features via linear embedding layers in a multi-set manner. Our method achieves competitive performance through embedding key point shifts with trivial computational cost, achieving the state-of-the-art performance of 78.81% on Mini-Kinetics and competitive performance on UCF101, Something-Something-v1 and HMDB51 datasets. 2022-10-31T05:25:16Z 2022-10-31T05:25:16Z 2021 Journal Article Cao, H., Xu, Y., Yang, J., Mao, K., Yin, J. & See, S. (2021). Effective action recognition with embedded key point shifts. Pattern Recognition, 120, 108172-. https://dx.doi.org/10.1016/j.patcog.2021.108172 0031-3203 https://hdl.handle.net/10356/162581 10.1016/j.patcog.2021.108172 2-s2.0-85111070128 120 108172 en Pattern Recognition © 2021 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Action Recognition Temporal Feature |
spellingShingle |
Engineering::Electrical and electronic engineering Action Recognition Temporal Feature Cao, Haozhi Xu, Yuecong Yang, Jianfei Mao, Kezhi Yin, Jianxiong See, Simon Effective action recognition with embedded key point shifts |
description |
Temporal feature extraction is an essential technique in video-based action recognition. Key points have been utilized in skeleton-based action recognition methods but they require costly key point annotation. In this paper, we propose a novel temporal feature extraction module, named Key Point Shifts Embedding Module (KPSEM), to adaptively extract channel-wise key point shifts across video frames without key point annotation. Key points are adaptively extracted as feature points with maximum feature values at split regions and key point shifts are the spatial displacements of corresponding key points. The key point shifts are encoded as the overall temporal features via linear embedding layers in a multi-set manner. Our method achieves competitive performance through embedding key point shifts with trivial computational cost, achieving the state-of-the-art performance of 78.81% on Mini-Kinetics and competitive performance on UCF101, Something-Something-v1 and HMDB51 datasets. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Cao, Haozhi Xu, Yuecong Yang, Jianfei Mao, Kezhi Yin, Jianxiong See, Simon |
format |
Article |
author |
Cao, Haozhi Xu, Yuecong Yang, Jianfei Mao, Kezhi Yin, Jianxiong See, Simon |
author_sort |
Cao, Haozhi |
title |
Effective action recognition with embedded key point shifts |
title_short |
Effective action recognition with embedded key point shifts |
title_full |
Effective action recognition with embedded key point shifts |
title_fullStr |
Effective action recognition with embedded key point shifts |
title_full_unstemmed |
Effective action recognition with embedded key point shifts |
title_sort |
effective action recognition with embedded key point shifts |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162581 |
_version_ |
1749179156894056448 |