Feature enhancement for robust speech recognition
The results of investigations into some aspects of robust speech recognition are reported in this thesis. Included in the topics that have been studied are feature extraction, training and decoding procedures, speech feature enhancement and model adaptation. In an automatic speech recognition (ASR)...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/20668 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-20668 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-206682023-07-04T17:03:38Z Feature enhancement for robust speech recognition Zhang, Yi Koh Soo Ngee School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering The results of investigations into some aspects of robust speech recognition are reported in this thesis. Included in the topics that have been studied are feature extraction, training and decoding procedures, speech feature enhancement and model adaptation. In an automatic speech recognition (ASR) system, feature extraction is critical to determining system performance. The most commonly used feature vectors for ASR are those based on the Mel Frequency Cepstral Coefficients (MFCC). However, it is well known that under noisy conditions, the performance of MFCC-based speech feature vectors degrades significantly. There have been many other robust features proposed in recent years and one that is derived from phase autocorrelation (PAC) was investigated. MASTER OF ENGINEERING (EEE) 2009-12-22T06:09:25Z 2009-12-22T06:09:25Z 2008 2008 Thesis Zhang, Y. (2008). Feature enhancement for robust speech recognition. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/20668 10.32657/10356/20668 en 147 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Zhang, Yi Feature enhancement for robust speech recognition |
description |
The results of investigations into some aspects of robust speech recognition are
reported in this thesis. Included in the topics that have been studied are feature extraction, training and decoding procedures, speech feature enhancement and model adaptation. In an automatic speech recognition (ASR) system, feature extraction is critical to determining system performance. The most commonly used feature vectors for ASR are those based on the Mel Frequency Cepstral Coefficients (MFCC). However, it is well known that under noisy conditions, the performance of MFCC-based speech feature vectors degrades significantly. There have been many other robust features proposed in recent years and one that is derived from phase autocorrelation (PAC) was investigated. |
author2 |
Koh Soo Ngee |
author_facet |
Koh Soo Ngee Zhang, Yi |
format |
Theses and Dissertations |
author |
Zhang, Yi |
author_sort |
Zhang, Yi |
title |
Feature enhancement for robust speech recognition |
title_short |
Feature enhancement for robust speech recognition |
title_full |
Feature enhancement for robust speech recognition |
title_fullStr |
Feature enhancement for robust speech recognition |
title_full_unstemmed |
Feature enhancement for robust speech recognition |
title_sort |
feature enhancement for robust speech recognition |
publishDate |
2009 |
url |
https://hdl.handle.net/10356/20668 |
_version_ |
1772828022606397440 |