Digital Theremin with computer vision

This study presents the methodology and development of a digital Theremin using MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a comprehensive user study, the project evaluated the intuitiveness of the gesture-based controls and the produced sound quality with...

Full description

Saved in:
Bibliographic Details
Main Author: Chua, Ryuichi
Other Authors: Goh Wooi Boon
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175253
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175253
record_format dspace
spelling sg-ntu-dr.10356-1752532024-04-26T15:41:17Z Digital Theremin with computer vision Chua, Ryuichi Goh Wooi Boon School of Computer Science and Engineering ASWBGOH@ntu.edu.sg Computer and Information Science Computer vision Human computer interaction Mediapipe Theremin This study presents the methodology and development of a digital Theremin using MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a comprehensive user study, the project evaluated the intuitiveness of the gesture-based controls and the produced sound quality with PyAudio. The digital Theremin was generally well-received, with participants demonstrating high controllability and showing a strong preference for a particular sound version noted for its warmth and expressiveness. Challenges identified include weak detection at screen edges and precise note-playing. Future work includes enhancing responsiveness and gesture recognition accuracy to expand the digital Theremin's appeal. Additionally, future work can investigate leveraging hand gestures in Unity development Bachelor's degree 2024-04-23T01:44:47Z 2024-04-23T01:44:47Z 2024 Final Year Project (FYP) Chua, R. (2024). Digital Theremin with computer vision. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175253 https://hdl.handle.net/10356/175253 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Computer vision
Human computer interaction
Mediapipe
Theremin
spellingShingle Computer and Information Science
Computer vision
Human computer interaction
Mediapipe
Theremin
Chua, Ryuichi
Digital Theremin with computer vision
description This study presents the methodology and development of a digital Theremin using MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a comprehensive user study, the project evaluated the intuitiveness of the gesture-based controls and the produced sound quality with PyAudio. The digital Theremin was generally well-received, with participants demonstrating high controllability and showing a strong preference for a particular sound version noted for its warmth and expressiveness. Challenges identified include weak detection at screen edges and precise note-playing. Future work includes enhancing responsiveness and gesture recognition accuracy to expand the digital Theremin's appeal. Additionally, future work can investigate leveraging hand gestures in Unity development
author2 Goh Wooi Boon
author_facet Goh Wooi Boon
Chua, Ryuichi
format Final Year Project
author Chua, Ryuichi
author_sort Chua, Ryuichi
title Digital Theremin with computer vision
title_short Digital Theremin with computer vision
title_full Digital Theremin with computer vision
title_fullStr Digital Theremin with computer vision
title_full_unstemmed Digital Theremin with computer vision
title_sort digital theremin with computer vision
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175253
_version_ 1814047089500356608