An improved model of masking effects for robust speech recognition system

Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front...

Full description

Saved in:

Bibliographic Details
Main Authors:	Dai, Peng, Soon, Ing Yann
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/99555 http://hdl.handle.net/10220/17437
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-99555
record_format	dspace
spelling	sg-ntu-dr.10356-995552020-03-07T14:00:31Z An improved model of masking effects for robust speech recognition system Dai, Peng Soon, Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front-end for Hidden Markov Model (HMM). The proposed algorithm is named LTFC which simulates properties of the human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates simultaneous masking, temporal masking and cepstral mean and variance normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm for robust speech recognition. The proposed method sharpens the power spectrum of the signal in both the frequency domain and the time domain. Evaluation tests are carried out on the AURORA2 database. Experimental results show that the word recognition rate using our proposed feature extraction method has been effectively increased. 2013-11-08T03:54:17Z 2019-12-06T20:08:45Z 2013-11-08T03:54:17Z 2019-12-06T20:08:45Z 2013 2013 Journal Article Dai, P., & Soon, I. Y. (2013). An improved model of masking effects for robust speech recognition system. Speech communication, 55(3), 387-396. 0167-6393 https://hdl.handle.net/10356/99555 http://hdl.handle.net/10220/17437 10.1016/j.specom.2012.12.005 en Speech communication
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Dai, Peng Soon, Ing Yann An improved model of masking effects for robust speech recognition system
description	Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front-end for Hidden Markov Model (HMM). The proposed algorithm is named LTFC which simulates properties of the human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates simultaneous masking, temporal masking and cepstral mean and variance normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm for robust speech recognition. The proposed method sharpens the power spectrum of the signal in both the frequency domain and the time domain. Evaluation tests are carried out on the AURORA2 database. Experimental results show that the word recognition rate using our proposed feature extraction method has been effectively increased.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Dai, Peng Soon, Ing Yann
format	Article
author	Dai, Peng Soon, Ing Yann
author_sort	Dai, Peng
title	An improved model of masking effects for robust speech recognition system
title_short	An improved model of masking effects for robust speech recognition system
title_full	An improved model of masking effects for robust speech recognition system
title_fullStr	An improved model of masking effects for robust speech recognition system
title_full_unstemmed	An improved model of masking effects for robust speech recognition system
title_sort	improved model of masking effects for robust speech recognition system
publishDate	2013
url	https://hdl.handle.net/10356/99555 http://hdl.handle.net/10220/17437
_version_	1681036370818105344

An improved model of masking effects for robust speech recognition system

Similar Items