Speech dereverberation

The phenomenon of reverberations has always and will continue to be a potential debating issue in terms of speech processing in an enclosed environment. For environments such as concert theatres and indoor stadiums, reverberations of the original source often tend to increase the “liveness” of the o...

Full description

Saved in:
Bibliographic Details
Main Author: Chua, Benedict Chi En.
Other Authors: School of Electrical and Electronic Engineering
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40818
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-40818
record_format dspace
spelling sg-ntu-dr.10356-408182023-07-07T17:48:18Z Speech dereverberation Chua, Benedict Chi En. School of Electrical and Electronic Engineering Khong Wai Hoong, Andy DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The phenomenon of reverberations has always and will continue to be a potential debating issue in terms of speech processing in an enclosed environment. For environments such as concert theatres and indoor stadiums, reverberations of the original source often tend to increase the “liveness” of the overall sound, as a result, improve the quality of the signal we receive in our ears. However, on flip side, when we discuss about reverberations occurring in a meeting room where a conference call is currently being made, these reverberations usually distort the original signal, and as a result, the listener on the other end of the phone line will experience an “echoy” and degraded speech signal instead of the ideal clean signal. This might lead to misinterpretations during communication through such conference calls, which in turn, might end up causing more serious implications. The interesting aspect of this problem is that we can use various methods to retrieve a signal as similar to the original signal as possible. In the case of this project and report, we will make use of calculated data using one of the simplest and most basic methods of delay-and-sum beamforming to introduce delays to signals received at every microphone in an array to retrieve the desired output. Our main goal of this project is to achieve maximum dereverberation using the delay-and-sum beamforming technique. Through this technique, we can estimate the Direction of Arrival (DOA) of the source signal, and using the appropriate delays calculated at this DOA, attempt to retrieve the source signal. The delay-and-sum beamformer poses several areas which can be potentially improved through further research. As the number of microphones in an array is directly proportional to the accuracy of the algorithm, more microphones in an array is preferable. However, increasing the amount of microphones also requires more computing capability. Improvements can be made in this aspect to improve the efficiency of the algorithm. Bachelor of Engineering 2010-06-22T06:25:17Z 2010-06-22T06:25:17Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40818 en Nanyang Technological University 57 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Chua, Benedict Chi En.
Speech dereverberation
description The phenomenon of reverberations has always and will continue to be a potential debating issue in terms of speech processing in an enclosed environment. For environments such as concert theatres and indoor stadiums, reverberations of the original source often tend to increase the “liveness” of the overall sound, as a result, improve the quality of the signal we receive in our ears. However, on flip side, when we discuss about reverberations occurring in a meeting room where a conference call is currently being made, these reverberations usually distort the original signal, and as a result, the listener on the other end of the phone line will experience an “echoy” and degraded speech signal instead of the ideal clean signal. This might lead to misinterpretations during communication through such conference calls, which in turn, might end up causing more serious implications. The interesting aspect of this problem is that we can use various methods to retrieve a signal as similar to the original signal as possible. In the case of this project and report, we will make use of calculated data using one of the simplest and most basic methods of delay-and-sum beamforming to introduce delays to signals received at every microphone in an array to retrieve the desired output. Our main goal of this project is to achieve maximum dereverberation using the delay-and-sum beamforming technique. Through this technique, we can estimate the Direction of Arrival (DOA) of the source signal, and using the appropriate delays calculated at this DOA, attempt to retrieve the source signal. The delay-and-sum beamformer poses several areas which can be potentially improved through further research. As the number of microphones in an array is directly proportional to the accuracy of the algorithm, more microphones in an array is preferable. However, increasing the amount of microphones also requires more computing capability. Improvements can be made in this aspect to improve the efficiency of the algorithm.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Chua, Benedict Chi En.
format Final Year Project
author Chua, Benedict Chi En.
author_sort Chua, Benedict Chi En.
title Speech dereverberation
title_short Speech dereverberation
title_full Speech dereverberation
title_fullStr Speech dereverberation
title_full_unstemmed Speech dereverberation
title_sort speech dereverberation
publishDate 2010
url http://hdl.handle.net/10356/40818
_version_ 1772828241882513408