Deep learning on lie groups for skeleton-based action recognition

In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie g...

Full description

Saved in:
Bibliographic Details
Main Authors: HUANG, Zhiwu, WAN, C., PROBST, T., VAN, Gool L.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6390
https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7393
record_format dspace
spelling sg-smu-ink.sis_research-73932021-11-23T02:36:08Z Deep learning on lie groups for skeleton-based action recognition HUANG, Zhiwu WAN, C. PROBST, T. VAN, Gool L. In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie group features. In this paper we incorporate the Lie group structure into a deep network architecture to learn more appropriate Lie group features for 3D action recognition. Within the network structure, we design rotation mapping layers to transform the input Lie group features into desirable ones, which are aligned better in the temporal domain. To reduce the high feature dimensionality, the architecture is equipped with rotation pooling layers for the elements on the Lie group. Furthermore, we propose a logarithm mapping layer to map the resulting manifold data into a tangent space that facilitates the application of regular output layers for the final classification. Evaluations of the proposed network for standard 3D human action recognition datasets clearly demonstrate its superiority over existing shallow Lie group feature learning methods as well as most conventional deep learning methods. 2017-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6390 info:doi/10.1109/CVPR.2017.137 https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Manifolds Three-dimensional displays Neural networks Machine learning Computer architecture Skeleton Transforms Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Manifolds
Three-dimensional displays
Neural networks
Machine learning
Computer architecture
Skeleton
Transforms
Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Manifolds
Three-dimensional displays
Neural networks
Machine learning
Computer architecture
Skeleton
Transforms
Databases and Information Systems
Graphics and Human Computer Interfaces
HUANG, Zhiwu
WAN, C.
PROBST, T.
VAN, Gool L.
Deep learning on lie groups for skeleton-based action recognition
description In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie group features. In this paper we incorporate the Lie group structure into a deep network architecture to learn more appropriate Lie group features for 3D action recognition. Within the network structure, we design rotation mapping layers to transform the input Lie group features into desirable ones, which are aligned better in the temporal domain. To reduce the high feature dimensionality, the architecture is equipped with rotation pooling layers for the elements on the Lie group. Furthermore, we propose a logarithm mapping layer to map the resulting manifold data into a tangent space that facilitates the application of regular output layers for the final classification. Evaluations of the proposed network for standard 3D human action recognition datasets clearly demonstrate its superiority over existing shallow Lie group feature learning methods as well as most conventional deep learning methods.
format text
author HUANG, Zhiwu
WAN, C.
PROBST, T.
VAN, Gool L.
author_facet HUANG, Zhiwu
WAN, C.
PROBST, T.
VAN, Gool L.
author_sort HUANG, Zhiwu
title Deep learning on lie groups for skeleton-based action recognition
title_short Deep learning on lie groups for skeleton-based action recognition
title_full Deep learning on lie groups for skeleton-based action recognition
title_fullStr Deep learning on lie groups for skeleton-based action recognition
title_full_unstemmed Deep learning on lie groups for skeleton-based action recognition
title_sort deep learning on lie groups for skeleton-based action recognition
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/6390
https://ink.library.smu.edu.sg/context/sis_research/article/7393/viewcontent/Deep_learning.pdf
_version_ 1770575951464235008