Skeleton-based human action recognition with graph neural networks

Skeleton-based action recognition is a long-standing task in computer vision which aims to distinguish different human actions by identifying their unique characteristic patterns in the input data. Most of the existing GCN-based models developed for this task primarily model the skeleton graph as ei...

Full description

Saved in:

Bibliographic Details
Main Author:	U S Vaitesswar
Other Authors:	Yeo Chai Kiat
Format:	Thesis-Master by Research
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/156866
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-156866
record_format	dspace
spelling	sg-ntu-dr.10356-1568662022-05-04T10:23:16Z Skeleton-based human action recognition with graph neural networks U S Vaitesswar Yeo Chai Kiat School of Computer Science and Engineering ASCKYEO@ntu.edu.sg Engineering::Computer science and engineering Skeleton-based action recognition is a long-standing task in computer vision which aims to distinguish different human actions by identifying their unique characteristic patterns in the input data. Most of the existing GCN-based models developed for this task primarily model the skeleton graph as either directed or undirected. Furthermore, these models also restrict the receptive field in the temporal domain to a fixed range which significantly inhibits their expressibility. Therefore, a mixed graph network comprising both directed and undirected graph networks with a multi-range temporal module called MMGCN is proposed. In this way, the model can benefit from the different interpretations of the same action by the different graphs. Adding on, the multi-range temporal module enhances the model’s expressibility as it can choose the appropriate receptive field for each layer, thus allowing the model to dynamically adapt to the input data. With this lightweight MMGCN model, it is shown that deep learning models can learn the underlying patterns in the data and model large receptive fields without additional semantics or high model complexity. Finally, this model achieved state-of-the-art results on benchmark datasets: NTU-RGB+D, NTU-RGB+D 120, Skeleton-Kinetics and Northwestern-UCLA despite its low model complexity thus proving its effectiveness. An additional study was conducted to weigh the importance of model complexity (i.e. more nuanced architecture) against ensemble model learning (i.e. multiple input streams). The insights derived from this study will be useful for future models developed for skeleton-based action recognition task. Master of Engineering 2022-04-26T06:25:10Z 2022-04-26T06:25:10Z 2022 Thesis-Master by Research U S Vaitesswar (2022). Skeleton-based human action recognition with graph neural networks. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156866 https://hdl.handle.net/10356/156866 10.32657/10356/156866 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering
spellingShingle	Engineering::Computer science and engineering U S Vaitesswar Skeleton-based human action recognition with graph neural networks
description	Skeleton-based action recognition is a long-standing task in computer vision which aims to distinguish different human actions by identifying their unique characteristic patterns in the input data. Most of the existing GCN-based models developed for this task primarily model the skeleton graph as either directed or undirected. Furthermore, these models also restrict the receptive field in the temporal domain to a fixed range which significantly inhibits their expressibility. Therefore, a mixed graph network comprising both directed and undirected graph networks with a multi-range temporal module called MMGCN is proposed. In this way, the model can benefit from the different interpretations of the same action by the different graphs. Adding on, the multi-range temporal module enhances the model’s expressibility as it can choose the appropriate receptive field for each layer, thus allowing the model to dynamically adapt to the input data. With this lightweight MMGCN model, it is shown that deep learning models can learn the underlying patterns in the data and model large receptive fields without additional semantics or high model complexity. Finally, this model achieved state-of-the-art results on benchmark datasets: NTU-RGB+D, NTU-RGB+D 120, Skeleton-Kinetics and Northwestern-UCLA despite its low model complexity thus proving its effectiveness. An additional study was conducted to weigh the importance of model complexity (i.e. more nuanced architecture) against ensemble model learning (i.e. multiple input streams). The insights derived from this study will be useful for future models developed for skeleton-based action recognition task.
author2	Yeo Chai Kiat
author_facet	Yeo Chai Kiat U S Vaitesswar
format	Thesis-Master by Research
author	U S Vaitesswar
author_sort	U S Vaitesswar
title	Skeleton-based human action recognition with graph neural networks
title_short	Skeleton-based human action recognition with graph neural networks
title_full	Skeleton-based human action recognition with graph neural networks
title_fullStr	Skeleton-based human action recognition with graph neural networks
title_full_unstemmed	Skeleton-based human action recognition with graph neural networks
title_sort	skeleton-based human action recognition with graph neural networks
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/156866
_version_	1734310299403550720

Skeleton-based human action recognition with graph neural networks

Similar Items