Time-domain speech enhancement using neural networks
The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Networ...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/54515 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-54515 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-545152023-07-07T17:28:13Z Time-domain speech enhancement using neural networks Lim, Cheris Jie Ying. Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network. Bachelor of Engineering 2013-06-21T06:02:24Z 2013-06-21T06:02:24Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/54515 en Nanyang Technological University 131 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering |
spellingShingle |
DRNTU::Engineering Lim, Cheris Jie Ying. Time-domain speech enhancement using neural networks |
description |
The scope of this project covered the objective of filtering the background noise in speech signal using a Neural Network (NN) while reducing the loss of speech content. More appropriately, it should be called as an Artificial Neural Network (ANN) as computer simulation was done to the Neural Network. This is similar to the Neural Network of the human brain. Neural Networks are made up of the biological neurons while Artificial Neural Networks are built based on artificial neurons which can be implemented on an electronic device like a computer. Both NN and ANN share the same concept of imputting information into the network so that they will able to train on their own to produce the desired results. In the Time-Domain NN, input corrupted noisy speech signals and target clean speech signals are fed into the NN for training. With sufficient training, the NN will able to remove background noise and thus improve the quality of speech during the testing stage. In this project, MATLAB software was used to implement the NN. The objective of this software has allowed the user to enhance noisy speech signals and measure both Signal-to-Noise (SNR) ratio and Segmental SNR (SEGSNR). Median filter (mfilter) was introduced to the NN after network training to increase the SNR and SEGSNR values by filtering off "shot" or impulse noise that existed in the speech. Experiments and anallysis were carried out to discover the ideal NN which produces the best speech enhancement results. Last but not least, problems faced and recommended further works was discussed to improve the current Time-Domain Neural Network. |
author2 |
Soon Ing Yann |
author_facet |
Soon Ing Yann Lim, Cheris Jie Ying. |
format |
Final Year Project |
author |
Lim, Cheris Jie Ying. |
author_sort |
Lim, Cheris Jie Ying. |
title |
Time-domain speech enhancement using neural networks |
title_short |
Time-domain speech enhancement using neural networks |
title_full |
Time-domain speech enhancement using neural networks |
title_fullStr |
Time-domain speech enhancement using neural networks |
title_full_unstemmed |
Time-domain speech enhancement using neural networks |
title_sort |
time-domain speech enhancement using neural networks |
publishDate |
2013 |
url |
http://hdl.handle.net/10356/54515 |
_version_ |
1772825195567906816 |