Real-Time Voice-Controlled Game Interaction using Convolutional Neural Networks

Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled inter...

Full description

Saved in:
Bibliographic Details
Main Authors: Waqar, Dania Maryam, Gunawan, Teddy Surya, Kartiwi, Mira, Ahmad, Robiah
Format: Conference or Workshop Item
Language:English
English
Published: IEEE 2021
Subjects:
Online Access:http://irep.iium.edu.my/92226/1/92226_Real-Time%20Voice-Controlled%20Game%20Interaction.pdf
http://irep.iium.edu.my/92226/7/92226_Real-Time%20Voice-Controlled%20Game%20Interaction%20using%20Convolutional%20Neural%20Networks_Scopus.pdf
http://irep.iium.edu.my/92226/
https://ieeexplore.ieee.org/abstract/document/9526318?casa_token=t5uDZk4Z8R0AAAAA:q27yMik06rVC85e9bVz14MRETLZ4O9kiw8BFZf4sw5S60yGuEuSTv9pPCotbnkkuqnqCurKYHQ
https://doi.org/10.1109/ICSIMA50015.2021.9526318
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
English
Description
Summary:Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.