Point cloud-based action recognition

Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-b...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Zhou, Chenhang
مؤلفون آخرون: Alex Chichung Kot
التنسيق: Final Year Project
اللغة:English
منشور في: Nanyang Technological University 2022
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/157837
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Nanyang Technological University
اللغة: English
الوصف
الملخص:Action recognition has received a lot of attention in computer vision tasks. It aims to capture and classify the action in a certain input like a video. In this project, we firstly thoroughly review two major methods for action recognition tasks, i.e., the skeleton-based method and the point cloud-based method. Second, to enhance video understanding, we select LSTM architecture which has proved to be competent in sequential understanding. We design sampling mechanisms for a video frame and point cloud to efficiently express the point cloud raw data for training on the LSTM network with NTU RGB+D 60 Dataset, contributed by NTU ROSE Lab. To investigate the relation within training parameters, multiple experiments are implemented based on PointLSTM architecture and we conclude model performance by evaluating the batch size, frame rate, and points number accordingly. We also find model discrepancies within different action classes based on the confusion matrix. Further, we compare different LSTM stages’ effects on model accuracy. Last, considering the nature of point cloud data, we conclude this project and make recommendations for further work.