Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression

Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challeng...

Full description

Saved in:
Bibliographic Details
Main Authors: Narwaria, Manish, Lin, Weisi, McLoughlin, Ian Vince, Emmanuel, Sabu, Chia, Clement Liang-Tien
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/102711
http://hdl.handle.net/10220/16453
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-102711
record_format dspace
spelling sg-ntu-dr.10356-1027112020-05-28T07:18:19Z Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression Narwaria, Manish Lin, Weisi McLoughlin, Ian Vince Emmanuel, Sabu Chia, Clement Liang-Tien School of Computer Engineering DRNTU::Humanities::Language::Linguistics Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach. 2013-10-11T01:42:27Z 2019-12-06T20:59:28Z 2013-10-11T01:42:27Z 2019-12-06T20:59:28Z 2011 2011 Journal Article Narwaria, M., Lin, W., McLoughlin, I. V., Emmanuel, S., & Chia, C. L. T. (2012). Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression. IEEE transactions on audio, speech, and language processing, 20(4), 1217-1232. https://hdl.handle.net/10356/102711 http://hdl.handle.net/10220/16453 10.1109/TASL.2011.2174223 en IEEE transactions on audio, speech, and language processing
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Humanities::Language::Linguistics
spellingShingle DRNTU::Humanities::Language::Linguistics
Narwaria, Manish
Lin, Weisi
McLoughlin, Ian Vince
Emmanuel, Sabu
Chia, Clement Liang-Tien
Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
description Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Narwaria, Manish
Lin, Weisi
McLoughlin, Ian Vince
Emmanuel, Sabu
Chia, Clement Liang-Tien
format Article
author Narwaria, Manish
Lin, Weisi
McLoughlin, Ian Vince
Emmanuel, Sabu
Chia, Clement Liang-Tien
author_sort Narwaria, Manish
title Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
title_short Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
title_full Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
title_fullStr Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
title_full_unstemmed Nonintrusive quality assessment of noise suppressed speech with Mel-Filtered energies and support vector regression
title_sort nonintrusive quality assessment of noise suppressed speech with mel-filtered energies and support vector regression
publishDate 2013
url https://hdl.handle.net/10356/102711
http://hdl.handle.net/10220/16453
_version_ 1681058876807446528