Energy efficient voice detection with spiking neural network for smart sensor applications
This project aims to design an algorithm for classifying voice commands for smart sensor devices, allowing them to be deployed in environments with poor network connectivity. The goal of the algorithm is to be energy efficient, so that the neural network that does the voice command classification ca...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/140766 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-140766 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1407662023-07-07T18:53:07Z Energy efficient voice detection with spiking neural network for smart sensor applications Tan, Chiah Ying Goh Wang Ling School of Electrical and Electronic Engineering Agency for Science, Technology and Research A*STAR Gao Yuan ewlgoh@ntu.edu.sg, gaoy@ime.a-star.edu.sg Engineering::Electrical and electronic engineering::Applications of electronics This project aims to design an algorithm for classifying voice commands for smart sensor devices, allowing them to be deployed in environments with poor network connectivity. The goal of the algorithm is to be energy efficient, so that the neural network that does the voice command classification can be executed locally on the device, without the need for cloud computing. This would be done by utilising a new generation of neural networks – spiking neural networks. The spiking neuron’s unique characteristic of transmitting information with the use of electrical spikes will be used to convert the voice samples into a sparse form, whereby the spikes represent segments of the voice sample with critical information, while the rest of the sample will be converted to zeros. This allows the hardware device to conserve a significant amount of energy, since it is able to maintain a state of rest in tandem during the rest periods of the spiking neuron. The spiking neural network was created with the Leaky integrate and fire model, and two classification tasks were undertaken. One was to classify voices based on gender, while the other was to classify the voice commands based on the command issued. The algorithm was trained using the Google Commands dataset, achieving 91% and 98% on the gender and command word classification task, respectively. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-06-02T02:00:25Z 2020-06-02T02:00:25Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/140766 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering::Applications of electronics |
spellingShingle |
Engineering::Electrical and electronic engineering::Applications of electronics Tan, Chiah Ying Energy efficient voice detection with spiking neural network for smart sensor applications |
description |
This project aims to design an algorithm for classifying voice commands for smart sensor devices, allowing them to be deployed in environments with poor network connectivity. The goal of the algorithm is to be energy efficient, so that the neural network that does the voice command classification can be executed locally on the device, without the need for cloud computing. This would be done by utilising a new generation of neural networks – spiking neural networks. The spiking neuron’s unique characteristic of transmitting information with the use of electrical spikes will be used to convert the voice samples into a sparse form, whereby the spikes represent segments of the voice sample with critical information, while the rest of the sample will be converted to zeros. This allows the hardware device to conserve a significant amount of energy, since it is able to maintain a state of rest in tandem during the rest periods of the spiking neuron. The spiking neural network was created with the Leaky integrate and fire model, and two classification tasks were undertaken. One was to classify voices based on gender, while the other was to classify the voice commands based on the command issued. The algorithm was trained using the Google Commands dataset, achieving 91% and 98% on the gender and command word classification task, respectively. |
author2 |
Goh Wang Ling |
author_facet |
Goh Wang Ling Tan, Chiah Ying |
format |
Final Year Project |
author |
Tan, Chiah Ying |
author_sort |
Tan, Chiah Ying |
title |
Energy efficient voice detection with spiking neural network for smart sensor applications |
title_short |
Energy efficient voice detection with spiking neural network for smart sensor applications |
title_full |
Energy efficient voice detection with spiking neural network for smart sensor applications |
title_fullStr |
Energy efficient voice detection with spiking neural network for smart sensor applications |
title_full_unstemmed |
Energy efficient voice detection with spiking neural network for smart sensor applications |
title_sort |
energy efficient voice detection with spiking neural network for smart sensor applications |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/140766 |
_version_ |
1772827613701603328 |