Robust isolated word speech recognition system
The aim of this study is to develop a Robust Isolated Word Speech Recognition System. A Dynamic Time Warping based Automatic Speech Recognition system is implemented which can recognize digits zero through nine, pronounced in English by different speakers under different conditions. Pattern matching...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/18798 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-18798 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-187982023-07-04T15:21:21Z Robust isolated word speech recognition system Rajan Sobhana Rashobh Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The aim of this study is to develop a Robust Isolated Word Speech Recognition System. A Dynamic Time Warping based Automatic Speech Recognition system is implemented which can recognize digits zero through nine, pronounced in English by different speakers under different conditions. Pattern matching method is adopted here. This system can be used for applications such as phone dialing. Mel Frequency Cepstral Coefficients (MFCCs) are used as the feature vectors in this project. The templates are created by using the combination of Dynamic Time Warping and Vector Quantization techniques. Templates are created with different sizes and the results are evaluated. The recognizer is evaluated against different noise conditions and speakers. The evaluated results are discussed and finally, this study also put forward the recommendation and future works to extend this project. Master of Science (Signal Processing) 2009-07-20T02:03:14Z 2009-07-20T02:03:14Z 2008 2008 Thesis http://hdl.handle.net/10356/18798 en 108 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Rajan Sobhana Rashobh Robust isolated word speech recognition system |
description |
The aim of this study is to develop a Robust Isolated Word Speech Recognition System. A Dynamic Time Warping based Automatic Speech Recognition system is implemented which can recognize digits zero through nine, pronounced in English by different speakers under different conditions. Pattern matching method is adopted here. This system can be used for applications such as phone dialing. Mel Frequency Cepstral Coefficients (MFCCs) are used as the feature vectors in this project. The templates are created by using the combination of Dynamic Time Warping and Vector Quantization techniques. Templates are created with different sizes and the results are evaluated. The recognizer is evaluated against different noise conditions and speakers. The evaluated results are discussed and finally, this study also put forward the recommendation and future works to extend this project. |
author2 |
Soon Ing Yann |
author_facet |
Soon Ing Yann Rajan Sobhana Rashobh |
format |
Theses and Dissertations |
author |
Rajan Sobhana Rashobh |
author_sort |
Rajan Sobhana Rashobh |
title |
Robust isolated word speech recognition system |
title_short |
Robust isolated word speech recognition system |
title_full |
Robust isolated word speech recognition system |
title_fullStr |
Robust isolated word speech recognition system |
title_full_unstemmed |
Robust isolated word speech recognition system |
title_sort |
robust isolated word speech recognition system |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/18798 |
_version_ |
1772826862389559296 |