Recognizing human actions as the evolution of pose estimation maps

Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human p...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu, Mengyuan, Yuan, Junsong
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/143208
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-143208
record_format dspace
spelling sg-ntu-dr.10356-1432082020-08-12T07:07:25Z Recognizing human actions as the evolution of pose estimation maps Liu, Mengyuan Yuan, Junsong School of Electrical and Electronic Engineering 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Engineering::Electrical and electronic engineering Pose Estimation Shape Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human pose estimation, this work presents a novel method to recognize human action as the evolution of pose estimation maps. Instead of relying on the inaccurate human poses estimated from videos, we observe that pose estimation maps, the byproduct of pose estimation, preserve richer cues of human body to benefit action recognition. Specifically, the evolution of pose estimation maps can be decomposed as an evolution of heatmaps, e.g., probabilistic maps, and an evolution of estimated 2D human poses, which denote the changes of body shape and body pose, respectively. Considering the sparse property of heatmap, we develop spatial rank pooling to aggregate the evolution of heatmaps as a body shape evolution image. As body shape evolution image does not differentiate body parts, we design body guided sampling to aggregate the evolution of poses as a body pose evolution image. The complementary properties between both types of images are explored by deep convolutional neural networks to predict action label. Experiments on NTU RGB+D, UTD-MHAD and PennAction datasets verify the effectiveness of our method, which outperforms most state-of-the-art methods. Ministry of Education (MOE) Accepted version This work is supported in part by Singapore Ministry of Education Academic Research Fund Tier 2 MOE2015-T2-2-114 and start-up funds of University at Buffalo. 2020-08-12T07:07:25Z 2020-08-12T07:07:25Z 2018 Conference Paper Liu, M., & Yuan, J. (2018). Recognizing human actions as the evolution of pose estimation maps. Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1159-1168. doi:10.1109/cvpr.2018.00127 978-1-5386-6421-6 https://hdl.handle.net/10356/143208 10.1109/cvpr.2018.00127 2-s2.0-85062833686 1159 1168 en © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/cvpr.2018.00127 application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Pose Estimation
Shape
spellingShingle Engineering::Electrical and electronic engineering
Pose Estimation
Shape
Liu, Mengyuan
Yuan, Junsong
Recognizing human actions as the evolution of pose estimation maps
description Most video-based action recognition approaches choose to extract features from the whole video to recognize actions. The cluttered background and non-action motions limit the performances of these methods, since they lack the explicit modeling of human body movements. With recent advances of human pose estimation, this work presents a novel method to recognize human action as the evolution of pose estimation maps. Instead of relying on the inaccurate human poses estimated from videos, we observe that pose estimation maps, the byproduct of pose estimation, preserve richer cues of human body to benefit action recognition. Specifically, the evolution of pose estimation maps can be decomposed as an evolution of heatmaps, e.g., probabilistic maps, and an evolution of estimated 2D human poses, which denote the changes of body shape and body pose, respectively. Considering the sparse property of heatmap, we develop spatial rank pooling to aggregate the evolution of heatmaps as a body shape evolution image. As body shape evolution image does not differentiate body parts, we design body guided sampling to aggregate the evolution of poses as a body pose evolution image. The complementary properties between both types of images are explored by deep convolutional neural networks to predict action label. Experiments on NTU RGB+D, UTD-MHAD and PennAction datasets verify the effectiveness of our method, which outperforms most state-of-the-art methods.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Liu, Mengyuan
Yuan, Junsong
format Conference or Workshop Item
author Liu, Mengyuan
Yuan, Junsong
author_sort Liu, Mengyuan
title Recognizing human actions as the evolution of pose estimation maps
title_short Recognizing human actions as the evolution of pose estimation maps
title_full Recognizing human actions as the evolution of pose estimation maps
title_fullStr Recognizing human actions as the evolution of pose estimation maps
title_full_unstemmed Recognizing human actions as the evolution of pose estimation maps
title_sort recognizing human actions as the evolution of pose estimation maps
publishDate 2020
url https://hdl.handle.net/10356/143208
_version_ 1681059352526454784