A lightweight convolutional neural network for KWS

Keyword spotting has been widely used in smart homes and mobile devices, where the goal is to achieve high accuracy with low latency and small computational effort. Convolutional neural networks are not only performed well in extracting speech features, but also show good robustness in the face of n...

Full description

Saved in:
Bibliographic Details
Main Author: Wang, Yumengmeng
Other Authors: Goh Wang Ling
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158686
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Keyword spotting has been widely used in smart homes and mobile devices, where the goal is to achieve high accuracy with low latency and small computational effort. Convolutional neural networks are not only performed well in extracting speech features, but also show good robustness in the face of noise interference. This dissertation proposes a convolutional neural network using Google's speech command as the dataset, with a trade-off between high accuracy and small memory. Existing techniques were applied to modify the network to obtain the results of the best performing model. Further, the training set is noise-loaded to show the effect of noise on the audio files, and the performance of the KWS-Net under three levels of noise is demonstrated. Finally, some advanced deep learning networks are used for speech recognition experiments on the same dataset, including ResNet, DenseNet, MobileNet. The results section compares the test results of the proposed network with these advanced networks in speech recognition, highlighting the good performance achieved by the KWS-Net in terms of high accuracy and low number of parameters.