Digital Theremin with computer vision
This study presents the methodology and development of a digital Theremin using MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a comprehensive user study, the project evaluated the intuitiveness of the gesture-based controls and the produced sound quality with...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175253 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-175253 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1752532024-04-26T15:41:17Z Digital Theremin with computer vision Chua, Ryuichi Goh Wooi Boon School of Computer Science and Engineering ASWBGOH@ntu.edu.sg Computer and Information Science Computer vision Human computer interaction Mediapipe Theremin This study presents the methodology and development of a digital Theremin using MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a comprehensive user study, the project evaluated the intuitiveness of the gesture-based controls and the produced sound quality with PyAudio. The digital Theremin was generally well-received, with participants demonstrating high controllability and showing a strong preference for a particular sound version noted for its warmth and expressiveness. Challenges identified include weak detection at screen edges and precise note-playing. Future work includes enhancing responsiveness and gesture recognition accuracy to expand the digital Theremin's appeal. Additionally, future work can investigate leveraging hand gestures in Unity development Bachelor's degree 2024-04-23T01:44:47Z 2024-04-23T01:44:47Z 2024 Final Year Project (FYP) Chua, R. (2024). Digital Theremin with computer vision. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175253 https://hdl.handle.net/10356/175253 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Computer vision Human computer interaction Mediapipe Theremin |
spellingShingle |
Computer and Information Science Computer vision Human computer interaction Mediapipe Theremin Chua, Ryuichi Digital Theremin with computer vision |
description |
This study presents the methodology and development of a digital Theremin using
MediaPipe to enable gesture-based controls for pitch, volume, and vibrato. Through a
comprehensive user study, the project evaluated the intuitiveness of the gesture-based
controls and the produced sound quality with PyAudio. The digital Theremin was generally
well-received, with participants demonstrating high controllability and showing a strong
preference for a particular sound version noted for its warmth and expressiveness. Challenges
identified include weak detection at screen edges and precise note-playing. Future work
includes enhancing responsiveness and gesture recognition accuracy to expand the digital
Theremin's appeal. Additionally, future work can investigate leveraging hand gestures in
Unity development |
author2 |
Goh Wooi Boon |
author_facet |
Goh Wooi Boon Chua, Ryuichi |
format |
Final Year Project |
author |
Chua, Ryuichi |
author_sort |
Chua, Ryuichi |
title |
Digital Theremin with computer vision |
title_short |
Digital Theremin with computer vision |
title_full |
Digital Theremin with computer vision |
title_fullStr |
Digital Theremin with computer vision |
title_full_unstemmed |
Digital Theremin with computer vision |
title_sort |
digital theremin with computer vision |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/175253 |
_version_ |
1814047089500356608 |