Discriminative feature extraction for speech recognition using continuous output codes

Feature transformation techniques have been widely investigated to reduce feature redundancy and to introduce additional discriminative information with the aim to improve the performance of automatic speech recognition (ASR). In this paper, we propose a novel method to obtain discriminative feature...

Full description

Saved in:

Bibliographic Details
Main Authors:	Dehzangi, Omid, Ma, Bin, Chng, Eng Siong, Li, Haizhou
Other Authors:	School of Computer Engineering
Format:	Article
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/105600 http://hdl.handle.net/10220/17340
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-105600
record_format	dspace
spelling	sg-ntu-dr.10356-1056002020-05-28T07:17:32Z Discriminative feature extraction for speech recognition using continuous output codes Dehzangi, Omid Ma, Bin Chng, Eng Siong Li, Haizhou School of Computer Engineering DRNTU::Engineering::Computer science and engineering Feature transformation techniques have been widely investigated to reduce feature redundancy and to introduce additional discriminative information with the aim to improve the performance of automatic speech recognition (ASR). In this paper, we propose a novel method to obtain discriminative feature transformation based on output coding technique for speech recognition. The output coding transformation projects the speech features from their original space to a new one where each dimension of the features captures information to distinguish different phones. Using polynomial expansion, the short-time spectral features are first expanded to a high-dimensional space where the generalized linear discriminant sequence kernel is applied on the sequences of input feature vectors. Then, the output coding transformation formulated via a set of linear SVMs projects the sequences of high dimensional vectors into a tractable low-dimensional feature space where the resultant features are well-separated continuous output codes for the subsequent multi-class classification problem. Our experimental results on the TIMIT corpus show that the proposed features achieve 10.5% ASR error rate reduction over the conventional spectral features. 2013-11-06T05:39:05Z 2019-12-06T21:54:17Z 2013-11-06T05:39:05Z 2019-12-06T21:54:17Z 2012 2012 Journal Article Dehzangi, O., Ma, B., Chng, E. S., & Li, H. (2012). Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recognition Letters, 33(13), 1703-1709. 0167-8655 https://hdl.handle.net/10356/105600 http://hdl.handle.net/10220/17340 10.1016/j.patrec.2012.05.012 en Pattern recognition letters
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering
spellingShingle	DRNTU::Engineering::Computer science and engineering Dehzangi, Omid Ma, Bin Chng, Eng Siong Li, Haizhou Discriminative feature extraction for speech recognition using continuous output codes
description	Feature transformation techniques have been widely investigated to reduce feature redundancy and to introduce additional discriminative information with the aim to improve the performance of automatic speech recognition (ASR). In this paper, we propose a novel method to obtain discriminative feature transformation based on output coding technique for speech recognition. The output coding transformation projects the speech features from their original space to a new one where each dimension of the features captures information to distinguish different phones. Using polynomial expansion, the short-time spectral features are first expanded to a high-dimensional space where the generalized linear discriminant sequence kernel is applied on the sequences of input feature vectors. Then, the output coding transformation formulated via a set of linear SVMs projects the sequences of high dimensional vectors into a tractable low-dimensional feature space where the resultant features are well-separated continuous output codes for the subsequent multi-class classification problem. Our experimental results on the TIMIT corpus show that the proposed features achieve 10.5% ASR error rate reduction over the conventional spectral features.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Dehzangi, Omid Ma, Bin Chng, Eng Siong Li, Haizhou
format	Article
author	Dehzangi, Omid Ma, Bin Chng, Eng Siong Li, Haizhou
author_sort	Dehzangi, Omid
title	Discriminative feature extraction for speech recognition using continuous output codes
title_short	Discriminative feature extraction for speech recognition using continuous output codes
title_full	Discriminative feature extraction for speech recognition using continuous output codes
title_fullStr	Discriminative feature extraction for speech recognition using continuous output codes
title_full_unstemmed	Discriminative feature extraction for speech recognition using continuous output codes
title_sort	discriminative feature extraction for speech recognition using continuous output codes
publishDate	2013
url	https://hdl.handle.net/10356/105600 http://hdl.handle.net/10220/17340
_version_	1681059447422582784

Discriminative feature extraction for speech recognition using continuous output codes

Similar Items