Deep learning on lie groups for skeleton-based action recognition
In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie g...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6390 https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7393 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-73932021-11-23T02:36:08Z Deep learning on lie groups for skeleton-based action recognition HUANG, Zhiwu WAN, C. PROBST, T. VAN, Gool L. In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie group features. In this paper we incorporate the Lie group structure into a deep network architecture to learn more appropriate Lie group features for 3D action recognition. Within the network structure, we design rotation mapping layers to transform the input Lie group features into desirable ones, which are aligned better in the temporal domain. To reduce the high feature dimensionality, the architecture is equipped with rotation pooling layers for the elements on the Lie group. Furthermore, we propose a logarithm mapping layer to map the resulting manifold data into a tangent space that facilitates the application of regular output layers for the final classification. Evaluations of the proposed network for standard 3D human action recognition datasets clearly demonstrate its superiority over existing shallow Lie group feature learning methods as well as most conventional deep learning methods. 2017-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6390 info:doi/10.1109/CVPR.2017.137 https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Manifolds Three-dimensional displays Neural networks Machine learning Computer architecture Skeleton Transforms Databases and Information Systems Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Manifolds Three-dimensional displays Neural networks Machine learning Computer architecture Skeleton Transforms Databases and Information Systems Graphics and Human Computer Interfaces |
spellingShingle |
Manifolds Three-dimensional displays Neural networks Machine learning Computer architecture Skeleton Transforms Databases and Information Systems Graphics and Human Computer Interfaces HUANG, Zhiwu WAN, C. PROBST, T. VAN, Gool L. Deep learning on lie groups for skeleton-based action recognition |
description |
In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie group features. In this paper we incorporate the Lie group structure into a deep network architecture to learn more appropriate Lie group features for 3D action recognition. Within the network structure, we design rotation mapping layers to transform the input Lie group features into desirable ones, which are aligned better in the temporal domain. To reduce the high feature dimensionality, the architecture is equipped with rotation pooling layers for the elements on the Lie group. Furthermore, we propose a logarithm mapping layer to map the resulting manifold data into a tangent space that facilitates the application of regular output layers for the final classification. Evaluations of the proposed network for standard 3D human action recognition datasets clearly demonstrate its superiority over existing shallow Lie group feature learning methods as well as most conventional deep learning methods. |
format |
text |
author |
HUANG, Zhiwu WAN, C. PROBST, T. VAN, Gool L. |
author_facet |
HUANG, Zhiwu WAN, C. PROBST, T. VAN, Gool L. |
author_sort |
HUANG, Zhiwu |
title |
Deep learning on lie groups for skeleton-based action recognition |
title_short |
Deep learning on lie groups for skeleton-based action recognition |
title_full |
Deep learning on lie groups for skeleton-based action recognition |
title_fullStr |
Deep learning on lie groups for skeleton-based action recognition |
title_full_unstemmed |
Deep learning on lie groups for skeleton-based action recognition |
title_sort |
deep learning on lie groups for skeleton-based action recognition |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/6390 https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf |
_version_ |
1770575951464235008 |