Time-domain speech enhancement using neural networks

The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Networ...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Cheris Jie Ying.
Other Authors: Soon Ing Yann
Format: Final Year Project
Language:English
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10356/54515
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-54515
record_format dspace
spelling sg-ntu-dr.10356-545152023-07-07T17:28:13Z Time-domain speech enhancement using neural networks Lim, Cheris Jie Ying. Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network. Bachelor of Engineering 2013-06-21T06:02:24Z 2013-06-21T06:02:24Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/54515 en Nanyang Technological University 131 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Lim, Cheris Jie Ying.
Time-domain speech enhancement using neural networks
description The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network.
author2 Soon Ing Yann
author_facet Soon Ing Yann
Lim, Cheris Jie Ying.
format Final Year Project
author Lim, Cheris Jie Ying.
author_sort Lim, Cheris Jie Ying.
title Time-domain speech enhancement using neural networks
title_short Time-domain speech enhancement using neural networks
title_full Time-domain speech enhancement using neural networks
title_fullStr Time-domain speech enhancement using neural networks
title_full_unstemmed Time-domain speech enhancement using neural networks
title_sort time-domain speech enhancement using neural networks
publishDate 2013
url http://hdl.handle.net/10356/54515
_version_ 1772825195567906816