ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE

Hand gestures are an alternative to complement the current shortcomings of natural user interfaces. Research on hand gesture recognition for interfaces has been conducted by Kopuklu, et al., (2019). The study used 3D-CNN to recognize hand gestures. This approach can provide good accuracy but havi...

Full description

Saved in:
Bibliographic Details
Main Author: Saputra, Dion
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/50589
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:50589
spelling id-itb.:505892020-09-24T18:45:13ZADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE Saputra, Dion Indonesia Final Project user interface, hand gesture, temporal shift module. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/50589 Hand gestures are an alternative to complement the current shortcomings of natural user interfaces. Research on hand gesture recognition for interfaces has been conducted by Kopuklu, et al., (2019). The study used 3D-CNN to recognize hand gestures. This approach can provide good accuracy but having high computation time. In another study, Lin, et al., (2019) conducted a study on 2DCNN-based learning of spatiotemporal features called the Temporal Shift Module (TSM). TSM provides lower computatino time than 3D-CNN. However, this approach cannot directly replace the 3D-CNN model in the hand gesture interface. This is due to differences in characteristics between the two approaches. In this final project, an adaptation was made so that TSM can be applied to the hand gesture interface. As the TSM backbone, MobileNetV2 is used because it has low computation time. The adaptations made in the form of adjustments to the detection mechanism and activation of hand gestures to the input and output characteristics of the TSM. In the detection mechanism, the detector uses a motion detection algorithm so that it can process hand gesture input per frame. Whereas in the activation mechanism, in addition to using the weighted accuracy method from the Kopuklu approach, et al., (2019), a frame abandonment mechanism is also used after activation. With the adaptations made, the resulting hand gesture interface has a good performance in terms of accuracy and computation time. The adaptation interface can provide 96.54% accuracy with a computation speed of 67 fps. This result is also better than the 3D-CNN-based approach. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Hand gestures are an alternative to complement the current shortcomings of natural user interfaces. Research on hand gesture recognition for interfaces has been conducted by Kopuklu, et al., (2019). The study used 3D-CNN to recognize hand gestures. This approach can provide good accuracy but having high computation time. In another study, Lin, et al., (2019) conducted a study on 2DCNN-based learning of spatiotemporal features called the Temporal Shift Module (TSM). TSM provides lower computatino time than 3D-CNN. However, this approach cannot directly replace the 3D-CNN model in the hand gesture interface. This is due to differences in characteristics between the two approaches. In this final project, an adaptation was made so that TSM can be applied to the hand gesture interface. As the TSM backbone, MobileNetV2 is used because it has low computation time. The adaptations made in the form of adjustments to the detection mechanism and activation of hand gestures to the input and output characteristics of the TSM. In the detection mechanism, the detector uses a motion detection algorithm so that it can process hand gesture input per frame. Whereas in the activation mechanism, in addition to using the weighted accuracy method from the Kopuklu approach, et al., (2019), a frame abandonment mechanism is also used after activation. With the adaptations made, the resulting hand gesture interface has a good performance in terms of accuracy and computation time. The adaptation interface can provide 96.54% accuracy with a computation speed of 67 fps. This result is also better than the 3D-CNN-based approach.
format Final Project
author Saputra, Dion
spellingShingle Saputra, Dion
ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
author_facet Saputra, Dion
author_sort Saputra, Dion
title ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
title_short ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
title_full ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
title_fullStr ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
title_full_unstemmed ADAPTATION OF MOBILENETV2-TSM FOR GESTURE BASED NATURAL USER INTERFACE
title_sort adaptation of mobilenetv2-tsm for gesture based natural user interface
url https://digilib.itb.ac.id/gdl/view/50589
_version_ 1822928494615592960