Two-channel noise reduction and post-processing for speech enhancement
This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-pr...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Published: |
2008
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/3522 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
id |
sg-ntu-dr.10356-3522 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-35222023-07-04T16:41:57Z Two-channel noise reduction and post-processing for speech enhancement Zhang, Xinxin Koh Soo Ngee School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2008 2008 Thesis Zhang, X. (2008). Two-channel noise reduction and post-processing for speech enhancement. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3522 10.32657/10356/3522 Nanyang Technological University application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Zhang, Xinxin Two-channel noise reduction and post-processing for speech enhancement |
description |
This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. |
author2 |
Koh Soo Ngee |
author_facet |
Koh Soo Ngee Zhang, Xinxin |
format |
Theses and Dissertations |
author |
Zhang, Xinxin |
author_sort |
Zhang, Xinxin |
title |
Two-channel noise reduction and post-processing for speech enhancement |
title_short |
Two-channel noise reduction and post-processing for speech enhancement |
title_full |
Two-channel noise reduction and post-processing for speech enhancement |
title_fullStr |
Two-channel noise reduction and post-processing for speech enhancement |
title_full_unstemmed |
Two-channel noise reduction and post-processing for speech enhancement |
title_sort |
two-channel noise reduction and post-processing for speech enhancement |
publishDate |
2008 |
url |
https://hdl.handle.net/10356/3522 |
_version_ |
1772827406771421184 |