Feature enhancement for robust speech recognition

The results of investigations into some aspects of robust speech recognition are reported in this thesis. Included in the topics that have been studied are feature extraction, training and decoding procedures, speech feature enhancement and model adaptation. In an automatic speech recognition (ASR)...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Yi
Other Authors: Koh Soo Ngee
Format: Theses and Dissertations
Language:English
Published: 2009
Subjects:
Online Access:https://hdl.handle.net/10356/20668
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:The results of investigations into some aspects of robust speech recognition are reported in this thesis. Included in the topics that have been studied are feature extraction, training and decoding procedures, speech feature enhancement and model adaptation. In an automatic speech recognition (ASR) system, feature extraction is critical to determining system performance. The most commonly used feature vectors for ASR are those based on the Mel Frequency Cepstral Coefficients (MFCC). However, it is well known that under noisy conditions, the performance of MFCC-based speech feature vectors degrades significantly. There have been many other robust features proposed in recent years and one that is derived from phase autocorrelation (PAC) was investigated.