Skeleton-based online action prediction using scale selection network

Action prediction is to recognize the class label of an ongoing activity when only a part of it is observed. In this paper, we focus on online action prediction in streaming 3D skeleton sequences. A dilated convolutional network is introduced to model the motion dynamics in temporal dimension via a...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu, Jun, Shahroudy, Amir, Wang, Gang, Duan, Ling-Yu, Kot, Alex C.
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/154882
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-154882
record_format dspace
spelling sg-ntu-dr.10356-1548822022-01-13T02:00:15Z Skeleton-based online action prediction using scale selection network Liu, Jun Shahroudy, Amir Wang, Gang Duan, Ling-Yu Kot, Alex C. School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Action Prediction Scale Selection Action prediction is to recognize the class label of an ongoing activity when only a part of it is observed. In this paper, we focus on online action prediction in streaming 3D skeleton sequences. A dilated convolutional network is introduced to model the motion dynamics in temporal dimension via a sliding window over the temporal axis. Since there are significant temporal scale variations in the observed part of the ongoing action at different time steps, a novel window scale selection method is proposed to make our network focus on the performed part of the ongoing action and try to suppress the possible incoming interference from the previous actions at each step. An activation sharing scheme is also proposed to handle the overlapping computations among the adjacent time steps, which enables our framework to run more efficiently. Moreover, to enhance the performance of our framework for action prediction with the skeletal input data, a hierarchy of dilated tree convolutions are also designed to learn the multi-level structured semantic representations over the skeleton joints at each frame. Our proposed approach is evaluated on four challenging datasets. The extensive experiments demonstrate the effectiveness of our method for skeleton-based online action prediction. National Research Foundation (NRF) ROSE Lab is supported by the National Research Foundation, Singapore, and the Infocomm Media Development Authority, Singapore. This work was supported in part by the National Basic Research Program of China under Grant 2015CB351806, and the National Natural Science Foundation of China under Grant 61661146005 and Grant U1611461. We acknowledge the NVIDIA AI Technology Centre (NVAITC) for the GPU donation. 2022-01-13T02:00:14Z 2022-01-13T02:00:14Z 2020 Journal Article Liu, J., Shahroudy, A., Wang, G., Duan, L. & Kot, A. C. (2020). Skeleton-based online action prediction using scale selection network. IEEE Transactions On Pattern Analysis and Machine Intelligence, 42(6), 1453-1467. https://dx.doi.org/10.1109/TPAMI.2019.2898954 0162-8828 https://hdl.handle.net/10356/154882 10.1109/TPAMI.2019.2898954 30762531 2-s2.0-85083666288 6 42 1453 1467 en IEEE Transactions on Pattern Analysis and Machine Intelligence © 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Action Prediction
Scale Selection
spellingShingle Engineering::Electrical and electronic engineering
Action Prediction
Scale Selection
Liu, Jun
Shahroudy, Amir
Wang, Gang
Duan, Ling-Yu
Kot, Alex C.
Skeleton-based online action prediction using scale selection network
description Action prediction is to recognize the class label of an ongoing activity when only a part of it is observed. In this paper, we focus on online action prediction in streaming 3D skeleton sequences. A dilated convolutional network is introduced to model the motion dynamics in temporal dimension via a sliding window over the temporal axis. Since there are significant temporal scale variations in the observed part of the ongoing action at different time steps, a novel window scale selection method is proposed to make our network focus on the performed part of the ongoing action and try to suppress the possible incoming interference from the previous actions at each step. An activation sharing scheme is also proposed to handle the overlapping computations among the adjacent time steps, which enables our framework to run more efficiently. Moreover, to enhance the performance of our framework for action prediction with the skeletal input data, a hierarchy of dilated tree convolutions are also designed to learn the multi-level structured semantic representations over the skeleton joints at each frame. Our proposed approach is evaluated on four challenging datasets. The extensive experiments demonstrate the effectiveness of our method for skeleton-based online action prediction.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Liu, Jun
Shahroudy, Amir
Wang, Gang
Duan, Ling-Yu
Kot, Alex C.
format Article
author Liu, Jun
Shahroudy, Amir
Wang, Gang
Duan, Ling-Yu
Kot, Alex C.
author_sort Liu, Jun
title Skeleton-based online action prediction using scale selection network
title_short Skeleton-based online action prediction using scale selection network
title_full Skeleton-based online action prediction using scale selection network
title_fullStr Skeleton-based online action prediction using scale selection network
title_full_unstemmed Skeleton-based online action prediction using scale selection network
title_sort skeleton-based online action prediction using scale selection network
publishDate 2022
url https://hdl.handle.net/10356/154882
_version_ 1722355395183247360