Speech recognition system using MATLAB : design, implementation, and samples codes

Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. Thi...

Full description

Saved in:
Bibliographic Details
Main Authors: Abushariah, Ahmad A. M., Gunawan, Teddy Surya
Format: Book
Language:English
Published: Lambert Academic Publishing 2011
Subjects:
Online Access:http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf
http://irep.iium.edu.my/27200/
https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
id my.iium.irep.27200
record_format dspace
spelling my.iium.irep.272002013-02-06T02:29:36Z http://irep.iium.edu.my/27200/ Speech recognition system using MATLAB : design, implementation, and samples codes Abushariah, Ahmad A. M. Gunawan, Teddy Surya TK5101 Telecommunication. Including telegraphy, radio, radar, television Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. This project aims to develop automated English digits speech recognition system. The project relies heavily on the well known and widely used statistical method in characterizing the speech pattern, the Hidden Markov Model (HMM), which provides a highly reliable way for recognizing speech. This project discusses the theory of HMM and then extends the ideas to the development and implementation by applying this method in computational speech recognition. Basically, the system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The HMM parameters are estimated by applying the Baum Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively successful recognition rates. In clean environment and isolated words speech recognition module, the multi-speaker mode achieved 99.5% whereas the speaker-independent mode achieved 79.5%. In clean environment and continuous speech recognition module, the multi-speaker mode achieved 70% whereas the speaker-independent mode achieved 55%. However in noisy environment and isolated words speech recognition module, the multi-speaker mode achieved 88% whereas the speaker-independent mode achieved 67%. In noisy environment and continuous speech recognition module, the multi-speaker mode achieved 92.5% whereas the speaker-independent mode achieved 75%. These recognition rates are relatively successful if compared to similar systems. Lambert Academic Publishing 2011 Book REM application/pdf en http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf Abushariah, Ahmad A. M. and Gunawan, Teddy Surya (2011) Speech recognition system using MATLAB : design, implementation, and samples codes. Lambert Academic Publishing, Saarbrucken, Germany. ISBN 978-3-8465-0376-8 https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
topic TK5101 Telecommunication. Including telegraphy, radio, radar, television
spellingShingle TK5101 Telecommunication. Including telegraphy, radio, radar, television
Abushariah, Ahmad A. M.
Gunawan, Teddy Surya
Speech recognition system using MATLAB : design, implementation, and samples codes
description Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. This project aims to develop automated English digits speech recognition system. The project relies heavily on the well known and widely used statistical method in characterizing the speech pattern, the Hidden Markov Model (HMM), which provides a highly reliable way for recognizing speech. This project discusses the theory of HMM and then extends the ideas to the development and implementation by applying this method in computational speech recognition. Basically, the system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The HMM parameters are estimated by applying the Baum Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively successful recognition rates. In clean environment and isolated words speech recognition module, the multi-speaker mode achieved 99.5% whereas the speaker-independent mode achieved 79.5%. In clean environment and continuous speech recognition module, the multi-speaker mode achieved 70% whereas the speaker-independent mode achieved 55%. However in noisy environment and isolated words speech recognition module, the multi-speaker mode achieved 88% whereas the speaker-independent mode achieved 67%. In noisy environment and continuous speech recognition module, the multi-speaker mode achieved 92.5% whereas the speaker-independent mode achieved 75%. These recognition rates are relatively successful if compared to similar systems.
format Book
author Abushariah, Ahmad A. M.
Gunawan, Teddy Surya
author_facet Abushariah, Ahmad A. M.
Gunawan, Teddy Surya
author_sort Abushariah, Ahmad A. M.
title Speech recognition system using MATLAB : design, implementation, and samples codes
title_short Speech recognition system using MATLAB : design, implementation, and samples codes
title_full Speech recognition system using MATLAB : design, implementation, and samples codes
title_fullStr Speech recognition system using MATLAB : design, implementation, and samples codes
title_full_unstemmed Speech recognition system using MATLAB : design, implementation, and samples codes
title_sort speech recognition system using matlab : design, implementation, and samples codes
publisher Lambert Academic Publishing
publishDate 2011
url http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf
http://irep.iium.edu.my/27200/
https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8
_version_ 1643609290639933440