Two-channel noise reduction and post-processing for speech enhancement

This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-pr...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Xinxin
Other Authors: Koh Soo Ngee
Format: Theses and Dissertations
Published: 2008
Subjects:
Online Access:https://hdl.handle.net/10356/3522
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
id sg-ntu-dr.10356-3522
record_format dspace
spelling sg-ntu-dr.10356-35222023-07-04T16:41:57Z Two-channel noise reduction and post-processing for speech enhancement Zhang, Xinxin Koh Soo Ngee School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2008 2008 Thesis Zhang, X. (2008). Two-channel noise reduction and post-processing for speech enhancement. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3522 10.32657/10356/3522 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Zhang, Xinxin
Two-channel noise reduction and post-processing for speech enhancement
description This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance.
author2 Koh Soo Ngee
author_facet Koh Soo Ngee
Zhang, Xinxin
format Theses and Dissertations
author Zhang, Xinxin
author_sort Zhang, Xinxin
title Two-channel noise reduction and post-processing for speech enhancement
title_short Two-channel noise reduction and post-processing for speech enhancement
title_full Two-channel noise reduction and post-processing for speech enhancement
title_fullStr Two-channel noise reduction and post-processing for speech enhancement
title_full_unstemmed Two-channel noise reduction and post-processing for speech enhancement
title_sort two-channel noise reduction and post-processing for speech enhancement
publishDate 2008
url https://hdl.handle.net/10356/3522
_version_ 1772827406771421184