Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition

Human Activity Recognition (HAR) has garnered significant interest from researchers in past decades. With the quick development of wearable sensor technology and the high availability of smart devices, e.g., accelerometers and gyroscopes embedded in smartphones, HAR has become a popular field of res...

Full description

Saved in:
Bibliographic Details
Main Author: Liao, Zixin
Other Authors: Kwoh Chee Keong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/166096
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-166096
record_format dspace
spelling sg-ntu-dr.10356-1660962023-04-21T15:38:36Z Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition Liao, Zixin Kwoh Chee Keong School of Computer Science and Engineering Biomedical Informatics Lab ASCKKWOH@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Human Activity Recognition (HAR) has garnered significant interest from researchers in past decades. With the quick development of wearable sensor technology and the high availability of smart devices, e.g., accelerometers and gyroscopes embedded in smartphones, HAR has become a popular field of research recently. In this paper, we propose a framework for HAR data classification which automatically extracts spatial and temporal features from smart device sensory data. This is achieved via a hybrid supervised learning architecture, that consists of a Convolutional Neural Network (CNN), and a Long Short-Term Memory Network (LSTM). However, a large amount of labeled data is typically required to perform supervised learning, which can be lacking due to data privacy concerns and the high cost of manual labeling in real-world scenarios. Therefore, learning from the large amounts of unlabeled data becomes crucial. To this end, we propose a self-supervised learning (SSL) framework that learns useful representations from unlabeled HAR sensory data. Our framework consists of two stages: 1) self-supervised pretraining, where we propose a set of pretext tasks to help the model learn from unlabeled data, and 2) fine-tuning the pre-trained model with the few available labeled samples according to the original HAR task. The results demonstrate that our SSL approach significantly improves the model performance compared to supervised training given limited labeled samples. In addition, by fine-tuning the simple 1-D CNN pre-trained self-supervised model using only 5% of labeled data, we can attain a level of performance that is comparable to the complex CNN-LSTM supervised training with full labels. Last, we observe that self-supervised pre-training assists the models in developing robustness to data imbalanced issues. The source code is available on https: //github.com/LizLicense/HAR-CNN-LSTM-ATT-pyTorch.git. Bachelor of Engineering (Computer Science) 2023-04-21T06:30:30Z 2023-04-21T06:30:30Z 2023 Final Year Project (FYP) Liao, Z. (2023). Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166096 https://hdl.handle.net/10356/166096 en PSCSE21-0033 doi:10.21979/N9/0SYHTZ doi:10.21979/N9/OWDFXO application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Liao, Zixin
Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
description Human Activity Recognition (HAR) has garnered significant interest from researchers in past decades. With the quick development of wearable sensor technology and the high availability of smart devices, e.g., accelerometers and gyroscopes embedded in smartphones, HAR has become a popular field of research recently. In this paper, we propose a framework for HAR data classification which automatically extracts spatial and temporal features from smart device sensory data. This is achieved via a hybrid supervised learning architecture, that consists of a Convolutional Neural Network (CNN), and a Long Short-Term Memory Network (LSTM). However, a large amount of labeled data is typically required to perform supervised learning, which can be lacking due to data privacy concerns and the high cost of manual labeling in real-world scenarios. Therefore, learning from the large amounts of unlabeled data becomes crucial. To this end, we propose a self-supervised learning (SSL) framework that learns useful representations from unlabeled HAR sensory data. Our framework consists of two stages: 1) self-supervised pretraining, where we propose a set of pretext tasks to help the model learn from unlabeled data, and 2) fine-tuning the pre-trained model with the few available labeled samples according to the original HAR task. The results demonstrate that our SSL approach significantly improves the model performance compared to supervised training given limited labeled samples. In addition, by fine-tuning the simple 1-D CNN pre-trained self-supervised model using only 5% of labeled data, we can attain a level of performance that is comparable to the complex CNN-LSTM supervised training with full labels. Last, we observe that self-supervised pre-training assists the models in developing robustness to data imbalanced issues. The source code is available on https: //github.com/LizLicense/HAR-CNN-LSTM-ATT-pyTorch.git.
author2 Kwoh Chee Keong
author_facet Kwoh Chee Keong
Liao, Zixin
format Final Year Project
author Liao, Zixin
author_sort Liao, Zixin
title Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
title_short Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
title_full Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
title_fullStr Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
title_full_unstemmed Deep CNN-LSTM supervised model and CNN self-supervised model for human activity recognition
title_sort deep cnn-lstm supervised model and cnn self-supervised model for human activity recognition
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/166096
_version_ 1764208075753914368