Point cloud-based action recognition

Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-b...

Full description

Saved in:
Bibliographic Details
Main Author: Zhou, Chenhang
Other Authors: Alex Chichung Kot
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/157837
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-based method. Second, to enhance video understanding, we select LSTM architecture which has proved to be competent in sequential understanding. We design sampling mechanisms for a video frame and point cloud to efficiently express the point cloud raw data for training on the LSTM network with NTU RGB+D 60 Dataset, contributed by NTU ROSE Lab. To investigate the relation within training parameters, multiple experiments are implemented based on PointLSTM architecture and we conclude model performance by evaluating the batch size, frame rate, and points number accordingly. We also find model discrepancies within different action classes based on the confusion matrix. Further, we compare different LSTM stages’ effects on model accuracy. Last, considering the nature of point cloud data, we conclude this project and make recommendations for further work.