Skeleton based action recognition with graph convolutional networks

Human Action Recognition (HAR) has become more popular in the research field of computer vision in recent years. It has the goal of understanding human actions and motion from captured data, using deep learning methods, to be able to classify each action or motion with a specific label. It can be us...

Full description

Saved in:
Bibliographic Details
Main Author: Han, Jia Yi
Other Authors: Alex Chichung Kot
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153996
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Human Action Recognition (HAR) has become more popular in the research field of computer vision in recent years. It has the goal of understanding human actions and motion from captured data, using deep learning methods, to be able to classify each action or motion with a specific label. It can be used in a broad range application of computer vision, such as security surveillance, autonomous navigation systems and for human safety operations. Different data modalities exist that are available to process for human action recognition, such as skeleton, depth, infrared, radar. The use of skeleton data modality has also become more popular. Following the recent advancements in methods of information capture, and increased number of data sensors, the vast amount of data available leads to more data capacity required to process it. The increased size of data to process leads to a much higher computational cost to evaluate classifications of actions. To combat this, many different deep learning methods were developed to reduce the amount of computational cost while not sacrificing performance and accuracy. With recent advancements in modelling techniques, newer methods of graph convolutional networks (GCNs) are used to model and classify human actions from skeleton data. In this project, Shift-GCN and MS-G3D are the main models are used to classify human actions.