Point cloud-based action recognition

Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-b...

全面介紹

Saved in:
書目詳細資料
主要作者: Zhou, Chenhang
其他作者: Alex Chichung Kot
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2022
主題:
在線閱讀:https://hdl.handle.net/10356/157837
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-based method. Second, to enhance video understanding, we select LSTM architecture which has proved to be competent in sequential understanding. We design sampling mechanisms for a video frame and point cloud to efficiently express the point cloud raw data for training on the LSTM network with NTU RGB+D 60 Dataset, contributed by NTU ROSE Lab. To investigate the relation within training parameters, multiple experiments are implemented based on PointLSTM architecture and we conclude model performance by evaluating the batch size, frame rate, and points number accordingly. We also find model discrepancies within different action classes based on the confusion matrix. Further, we compare different LSTM stages’ effects on model accuracy. Last, considering the nature of point cloud data, we conclude this project and make recommendations for further work.