Recognizing human actions as the evolution of pose estimation maps
Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human p...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/143208 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-143208 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1432082020-08-12T07:07:25Z Recognizing human actions as the evolution of pose estimation maps Liu, Mengyuan Yuan, Junsong School of Electrical and Electronic Engineering 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Engineering::Electrical and electronic engineering Pose Estimation Shape Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human pose estimation, this work presents a novel method to recognize human action as the evolution of pose estimation maps. Instead of relying on the inaccurate human poses estimated from videos, we observe that pose estimation maps, the byproduct of pose estimation, preserve richer cues of human body to benefit action recognition. Specifically, the evolution of pose estimation maps can be decomposed as an evolution of heatmaps, e.g., probabilistic maps, and an evolution of estimated 2D human poses, which denote the changes of body shape and body pose, respectively. Considering the sparse property of heatmap, we develop spatial rank pooling to aggregate the evolution of heatmaps as a body shape evolution image. As body shape evolution image does not differentiate body parts, we design body guided sampling to aggregate the evolution of poses as a body pose evolution image. The complementary properties between both types of images are explored by deep convolutional neural networks to predict action label. Experiments on NTU RGB+D, UTD-MHAD and PennAction datasets verify the effectiveness of our method, which outperforms most state-of-the-art methods. Ministry of Education (MOE) Accepted version This work is supported in part by Singapore Ministry of Education Academic Research Fund Tier 2 MOE2015-T2-2-114 and start-up funds of University at Buffalo. 2020-08-12T07:07:25Z 2020-08-12T07:07:25Z 2018 Conference Paper Liu, M., & Yuan, J. (2018). Recognizing human actions as the evolution of pose estimation maps. Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1159-1168. doi:10.1109/cvpr.2018.00127 978-1-5386-6421-6 https://hdl.handle.net/10356/143208 10.1109/cvpr.2018.00127 2-s2.0-85062833686 1159 1168 en © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/cvpr.2018.00127 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Pose Estimation Shape |
spellingShingle |
Engineering::Electrical and electronic engineering Pose Estimation Shape Liu, Mengyuan Yuan, Junsong Recognizing human actions as the evolution of pose estimation maps |
description |
Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human pose estimation, this work presents a novel method to recognize human action as the evolution of pose estimation maps. Instead of relying on the inaccurate human poses estimated from videos, we observe that pose estimation maps, the byproduct of pose estimation, preserve richer cues of human body to benefit action recognition. Specifically, the evolution of pose estimation maps can be decomposed as an evolution of heatmaps, e.g., probabilistic maps, and an evolution of estimated 2D human poses, which denote the changes of body shape and body pose, respectively. Considering the sparse property of heatmap, we develop spatial rank pooling to aggregate the evolution of heatmaps as a body shape evolution image. As body shape evolution image does not differentiate body parts, we design body guided sampling to aggregate the evolution of poses as a body pose evolution image. The complementary properties between both types of images are explored by deep convolutional neural networks to predict action label. Experiments on NTU RGB+D, UTD-MHAD and PennAction datasets verify the effectiveness of our method, which outperforms most state-of-the-art methods. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Liu, Mengyuan Yuan, Junsong |
format |
Conference or Workshop Item |
author |
Liu, Mengyuan Yuan, Junsong |
author_sort |
Liu, Mengyuan |
title |
Recognizing human actions as the evolution of pose estimation maps |
title_short |
Recognizing human actions as the evolution of pose estimation maps |
title_full |
Recognizing human actions as the evolution of pose estimation maps |
title_fullStr |
Recognizing human actions as the evolution of pose estimation maps |
title_full_unstemmed |
Recognizing human actions as the evolution of pose estimation maps |
title_sort |
recognizing human actions as the evolution of pose estimation maps |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/143208 |
_version_ |
1681059352526454784 |