Time-domain speech enhancement using neural networks

The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Networ...

Full description

Saved in:

Bibliographic Details
Main Author:	Lim, Cheris Jie Ying.
Other Authors:	Soon Ing Yann
Format:	Final Year Project
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering
Online Access:	http://hdl.handle.net/10356/54515
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-54515
record_format	dspace
spelling	sg-ntu-dr.10356-545152023-07-07T17:28:13Z Time-domain speech enhancement using neural networks Lim, Cheris Jie Ying. Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network. Bachelor of Engineering 2013-06-21T06:02:24Z 2013-06-21T06:02:24Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/54515 en Nanyang Technological University 131 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering
spellingShingle	DRNTU::Engineering Lim, Cheris Jie Ying. Time-domain speech enhancement using neural networks
description	The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network.
author2	Soon Ing Yann
author_facet	Soon Ing Yann Lim, Cheris Jie Ying.
format	Final Year Project
author	Lim, Cheris Jie Ying.
author_sort	Lim, Cheris Jie Ying.
title	Time-domain speech enhancement using neural networks
title_short	Time-domain speech enhancement using neural networks
title_full	Time-domain speech enhancement using neural networks
title_fullStr	Time-domain speech enhancement using neural networks
title_full_unstemmed	Time-domain speech enhancement using neural networks
title_sort	time-domain speech enhancement using neural networks
publishDate	2013
url	http://hdl.handle.net/10356/54515
_version_	1772825195567906816

Time-domain speech enhancement using neural networks

Similar Items