Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi

Speech recognition has become popular during recent decades due to its widespread applications such as telephone systems, health care domain, data entry, speech to text processing, biometric systems, training air traffic controllers and so on. Among the technologies that have been investigated in ac...

Full description

Saved in:
Bibliographic Details
Main Author: Afshan, Kordi
Format: Thesis
Published: 2012
Subjects:
Online Access:http://studentsrepo.um.edu.my/7715/5/afshan.pdf
http://studentsrepo.um.edu.my/7715/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.stud.7715
record_format eprints
spelling my.um.stud.77152019-07-18T00:55:22Z Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi Afshan, Kordi T Technology (General) TA Engineering (General). Civil engineering (General) Speech recognition has become popular during recent decades due to its widespread applications such as telephone systems, health care domain, data entry, speech to text processing, biometric systems, training air traffic controllers and so on. Among the technologies that have been investigated in acoustic modeling of speech, Artificial Neural Networks (ANN) have received interest from many researchers as they have shown good results in pattern recognition specially in classification. Despite of noteworthy progress in speech classification using neural networks, some unresolved issues still are remained in utilizing and performing the neural networks. Particularly less effort has been done on the speech of children which is more dynamic. There are numerous neural network architectures introduced by scientists that the most common sufficient for speech recognition include: Multi-Layer Perceptron (MLP) and Recurrent Neural Network (RNN). The purpose of this study is to compare the performance and recognition rate of these two types of neural networks in terms of signal length and number of hidden neurons for sustained Malay vowel among Malay children. Linear Predictive Coding (LPC) is used as a feature extractor to convert the speech signal into parametric coefficients. The Neural Network Toolbox™ (nntool) in Matlab® is used to classify the six Malay vowels (/a/, /e/ /ә/, /i/, /o/ and /u/) according to the 3-fold cross validation technique in different signal lengths with different number of hidden neurons. Experiments were done to compare the performance of the neural networks using single frame and multiple frame approach as well. The results show that longer signal lengths perform better than those in short signal lengths. The findings indicate that MLP and RNN reached a recognition rate of 83.79% and 83.10% respectively. Vowel /i/ got the highest recognition rate in both methods. 2012 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/7715/5/afshan.pdf Afshan, Kordi (2012) Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi. Masters thesis, University of Malaya. http://studentsrepo.um.edu.my/7715/
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Student Repository
url_provider http://studentsrepo.um.edu.my/
topic T Technology (General)
TA Engineering (General). Civil engineering (General)
spellingShingle T Technology (General)
TA Engineering (General). Civil engineering (General)
Afshan, Kordi
Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
description Speech recognition has become popular during recent decades due to its widespread applications such as telephone systems, health care domain, data entry, speech to text processing, biometric systems, training air traffic controllers and so on. Among the technologies that have been investigated in acoustic modeling of speech, Artificial Neural Networks (ANN) have received interest from many researchers as they have shown good results in pattern recognition specially in classification. Despite of noteworthy progress in speech classification using neural networks, some unresolved issues still are remained in utilizing and performing the neural networks. Particularly less effort has been done on the speech of children which is more dynamic. There are numerous neural network architectures introduced by scientists that the most common sufficient for speech recognition include: Multi-Layer Perceptron (MLP) and Recurrent Neural Network (RNN). The purpose of this study is to compare the performance and recognition rate of these two types of neural networks in terms of signal length and number of hidden neurons for sustained Malay vowel among Malay children. Linear Predictive Coding (LPC) is used as a feature extractor to convert the speech signal into parametric coefficients. The Neural Network Toolbox™ (nntool) in Matlab® is used to classify the six Malay vowels (/a/, /e/ /ә/, /i/, /o/ and /u/) according to the 3-fold cross validation technique in different signal lengths with different number of hidden neurons. Experiments were done to compare the performance of the neural networks using single frame and multiple frame approach as well. The results show that longer signal lengths perform better than those in short signal lengths. The findings indicate that MLP and RNN reached a recognition rate of 83.79% and 83.10% respectively. Vowel /i/ got the highest recognition rate in both methods.
format Thesis
author Afshan, Kordi
author_facet Afshan, Kordi
author_sort Afshan, Kordi
title Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
title_short Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
title_full Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
title_fullStr Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
title_full_unstemmed Comparative study on Malay children vowel recognition using multi-layer perceptron and recurrent neural networks / Afshan Kordi
title_sort comparative study on malay children vowel recognition using multi-layer perceptron and recurrent neural networks / afshan kordi
publishDate 2012
url http://studentsrepo.um.edu.my/7715/5/afshan.pdf
http://studentsrepo.um.edu.my/7715/
_version_ 1738506054082232320