Single channel speech enhancement

In this report, we present our research work on single channel speech enhancement. First, current major speech enhancement algorithms are reviewed and we conclude that the S-MMSE-LSAE algorithm is the best in overall performance among the well known algorithms reported in the existing literature and...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Guo., Koh, Soo Ngee., Soon, Inn Yann.
Other Authors: School of Electrical and Electronic Engineering
Format: Research Report
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10356/2845
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
id sg-ntu-dr.10356-2845
record_format dspace
spelling sg-ntu-dr.10356-28452023-03-04T03:25:22Z Single channel speech enhancement Chen, Guo. Koh, Soo Ngee. Soon, Inn Yann. School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems In this report, we present our research work on single channel speech enhancement. First, current major speech enhancement algorithms are reviewed and we conclude that the S-MMSE-LSAE algorithm is the best in overall performance among the well known algorithms reported in the existing literature and examined by the authors under most signal-to-noise ratios conditions. Secondly, an enhanced Itakura speech distortion measure is examined in our study. The proposed measure incorporates masking properties of the human auditory system into the original Itakura measure and substantially improves the correlation degree of objective measure with subjective evaluation. Next, an F-norm constrained SVD enhancement algorithm based on the conventional SVD-based algorithms is proposed. The traditional SVD algorithms are usually limited by the use of the fixed order of retained singular values, which are difficult to match diverse corrupted speech signals embedded in various noise environments. 2008-09-17T09:15:29Z 2008-09-17T09:15:29Z 2003 2003 Research Report http://hdl.handle.net/10356/2845 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
Chen, Guo.
Koh, Soo Ngee.
Soon, Inn Yann.
Single channel speech enhancement
description In this report, we present our research work on single channel speech enhancement. First, current major speech enhancement algorithms are reviewed and we conclude that the S-MMSE-LSAE algorithm is the best in overall performance among the well known algorithms reported in the existing literature and examined by the authors under most signal-to-noise ratios conditions. Secondly, an enhanced Itakura speech distortion measure is examined in our study. The proposed measure incorporates masking properties of the human auditory system into the original Itakura measure and substantially improves the correlation degree of objective measure with subjective evaluation. Next, an F-norm constrained SVD enhancement algorithm based on the conventional SVD-based algorithms is proposed. The traditional SVD algorithms are usually limited by the use of the fixed order of retained singular values, which are difficult to match diverse corrupted speech signals embedded in various noise environments.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Chen, Guo.
Koh, Soo Ngee.
Soon, Inn Yann.
format Research Report
author Chen, Guo.
Koh, Soo Ngee.
Soon, Inn Yann.
author_sort Chen, Guo.
title Single channel speech enhancement
title_short Single channel speech enhancement
title_full Single channel speech enhancement
title_fullStr Single channel speech enhancement
title_full_unstemmed Single channel speech enhancement
title_sort single channel speech enhancement
publishDate 2008
url http://hdl.handle.net/10356/2845
_version_ 1759858405614288896