Gesture recognition in car cabin environment

As a research hotspot in the field of computer vision, gesture recognition has received extensive attention in recent years. With the rapid development of intelligent driving technology, in-car human-computer interaction has gradually become an important research direction. Gesture recognition shows...

Full description

Saved in:
Bibliographic Details
Main Author: He, Siqi
Other Authors: Yap Kim Hui
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181406
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181406
record_format dspace
spelling sg-ntu-dr.10356-1814062024-12-06T15:49:18Z Gesture recognition in car cabin environment He, Siqi Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Computer and Information Science Gesture recognition 3D convolutional neural networks Res-C3D network LSTM Attention mechanisms As a research hotspot in the field of computer vision, gesture recognition has received extensive attention in recent years. With the rapid development of intelligent driving technology, in-car human-computer interaction has gradually become an important research direction. Gesture recognition shows a wide range of application prospects because of its natural and intuitive interaction mode. This dissertation aims to implement and optimize in-car gesture recognition technology and compare the performance of different deep learning models based on the NVIDIA Gesture Dynamic Hand Gesture (NVGesture) Dataset. After preprocessing the video dataset, I implemented and evaluated the basic 3DCNN model, ResNet model, ResNet model with attention, CNN-LSTM model, and CNN-LSTM model with attention for in-depth analysis of their performance in gesture recognition tasks. The experimental results show that the 3DCNN model achieved a lower accuracy of approximately 78.97% during the gesture recognition tasks. Then, the structure of ResNet models, particularly with attention mechanisms, showed slight improvements in performance. Ultimately, the CNN-LSTM model, especially the version enhanced with attention mechanisms, demonstrated the highest accuracy, reaching 95.14%. The research shows that gesture recognition technology based on the NVGesture dataset has great application potential in the intelligent interaction system in the car, and can be further applied in the driver assistance system and intelligent cockpit in the future. Master's degree 2024-12-02T02:17:52Z 2024-12-02T02:17:52Z 2024 Thesis-Master by Coursework He, S. (2024). Gesture recognition in car cabin environment. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181406 https://hdl.handle.net/10356/181406 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Gesture recognition
3D convolutional neural networks
Res-C3D network
LSTM
Attention mechanisms
spellingShingle Computer and Information Science
Gesture recognition
3D convolutional neural networks
Res-C3D network
LSTM
Attention mechanisms
He, Siqi
Gesture recognition in car cabin environment
description As a research hotspot in the field of computer vision, gesture recognition has received extensive attention in recent years. With the rapid development of intelligent driving technology, in-car human-computer interaction has gradually become an important research direction. Gesture recognition shows a wide range of application prospects because of its natural and intuitive interaction mode. This dissertation aims to implement and optimize in-car gesture recognition technology and compare the performance of different deep learning models based on the NVIDIA Gesture Dynamic Hand Gesture (NVGesture) Dataset. After preprocessing the video dataset, I implemented and evaluated the basic 3DCNN model, ResNet model, ResNet model with attention, CNN-LSTM model, and CNN-LSTM model with attention for in-depth analysis of their performance in gesture recognition tasks. The experimental results show that the 3DCNN model achieved a lower accuracy of approximately 78.97% during the gesture recognition tasks. Then, the structure of ResNet models, particularly with attention mechanisms, showed slight improvements in performance. Ultimately, the CNN-LSTM model, especially the version enhanced with attention mechanisms, demonstrated the highest accuracy, reaching 95.14%. The research shows that gesture recognition technology based on the NVGesture dataset has great application potential in the intelligent interaction system in the car, and can be further applied in the driver assistance system and intelligent cockpit in the future.
author2 Yap Kim Hui
author_facet Yap Kim Hui
He, Siqi
format Thesis-Master by Coursework
author He, Siqi
author_sort He, Siqi
title Gesture recognition in car cabin environment
title_short Gesture recognition in car cabin environment
title_full Gesture recognition in car cabin environment
title_fullStr Gesture recognition in car cabin environment
title_full_unstemmed Gesture recognition in car cabin environment
title_sort gesture recognition in car cabin environment
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181406
_version_ 1819113060445454336