A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the r...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shao, Yu, Chang, Chip Hong
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2009
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/91337 http://hdl.handle.net/10220/6027
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-91337
record_format	dspace
spelling	sg-ntu-dr.10356-913372020-03-07T14:02:39Z A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system Shao, Yu Chang, Chip Hong School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized time–frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time–frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods. Published version 2009-08-05T01:01:02Z 2019-12-06T18:03:54Z 2009-08-05T01:01:02Z 2019-12-06T18:03:54Z 2007 2007 Journal Article Shao, Y., & Chang, C. H. (2007). A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, 37(4), 877-889. 1083-4419 https://hdl.handle.net/10356/91337 http://hdl.handle.net/10220/6027 10.1109/TSMCB.2007.895365 en IEEE transactions on systems, man, and cybernetics-part B : cybernetics IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics © copyright 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. http://www.ieee.org/portal/site. 13 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Shao, Yu Chang, Chip Hong A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
description	We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized time–frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time–frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Shao, Yu Chang, Chip Hong
format	Article
author	Shao, Yu Chang, Chip Hong
author_sort	Shao, Yu
title	A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
title_short	A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
title_full	A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
title_fullStr	A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
title_full_unstemmed	A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
title_sort	generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system
publishDate	2009
url	https://hdl.handle.net/10356/91337 http://hdl.handle.net/10220/6027
_version_	1681040192380600320

A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system

Similar Items