Skeleton based action recognition with graph convolutional networks
Human Action Recognition (HAR) has become more popular in the research field of computer vision in recent years. It has the goal of understanding human actions and motion from captured data, using deep learning methods, to be able to classify each action or motion with a specific label. It can be us...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/153996 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Human Action Recognition (HAR) has become more popular in the research field of computer vision in recent years. It has the goal of understanding human actions and motion from captured data, using deep learning methods, to be able to classify each action or motion with a specific label. It can be used in a broad range application of computer vision, such as security surveillance, autonomous navigation systems and for human safety operations. Different data modalities exist that are available to process for human action recognition, such as skeleton, depth, infrared, radar. The use of skeleton data modality has also become more popular. Following the recent advancements in methods of information capture, and increased number of data sensors, the vast amount of data available leads to more data capacity required to process it. The increased size of data to process leads to a much higher computational cost to evaluate classifications of actions. To combat this, many different deep learning methods were developed to reduce the amount of computational cost while not sacrificing performance and accuracy.
With recent advancements in modelling techniques, newer methods of graph convolutional networks (GCNs) are used to model and classify human actions from skeleton data.
In this project, Shift-GCN and MS-G3D are the main models are used to classify human actions. |
---|