Deep learning speech enhancement in satellite radio communication

Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the s...

Full description

Saved in:

Bibliographic Details
Main Author:	Low, Yuki Yu Jun
Other Authors:	Arokiaswami Alphones
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/157874
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-157874
record_format	dspace
spelling	sg-ntu-dr.10356-1578742023-07-07T19:07:23Z Deep learning speech enhancement in satellite radio communication Low, Yuki Yu Jun Arokiaswami Alphones School of Electrical and Electronic Engineering EAlphones@ntu.edu.sg Engineering::Electrical and electronic engineering Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the speech enhancement model. In this report, we study about the source of noise in an audio when air pilot controllers communicate and methods used for speech enhancement, mainly Wave-U-Net and a hybrid Recurrent Neural Network-based model. Wave-U-Net is a multi-scale neural network that provides end-to-end audio source separation, which is a modification from U-Net. Wave-U-Net repeatedly resamples feature maps to calculate and integrate features at different time scales [1]. The RNN-based model uses a hybrid of deep learning in conjunction with the basics of audio signal processing. Our experiment shows that the proposed Wave-U-Net method improves the audio quality consistently with PESQ metric – a test methodology that automatically assess speech quality when compared to the hybrid RNN-based model. Bachelor of Engineering (Information Engineering and Media) 2022-05-24T12:42:43Z 2022-05-24T12:42:43Z 2022 Final Year Project (FYP) Low, Y. Y. J. (2022). Deep learning speech enhancement in satellite radio communication. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/157874 https://hdl.handle.net/10356/157874 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Low, Yuki Yu Jun Deep learning speech enhancement in satellite radio communication
description	Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the speech enhancement model. In this report, we study about the source of noise in an audio when air pilot controllers communicate and methods used for speech enhancement, mainly Wave-U-Net and a hybrid Recurrent Neural Network-based model. Wave-U-Net is a multi-scale neural network that provides end-to-end audio source separation, which is a modification from U-Net. Wave-U-Net repeatedly resamples feature maps to calculate and integrate features at different time scales [1]. The RNN-based model uses a hybrid of deep learning in conjunction with the basics of audio signal processing. Our experiment shows that the proposed Wave-U-Net method improves the audio quality consistently with PESQ metric – a test methodology that automatically assess speech quality when compared to the hybrid RNN-based model.
author2	Arokiaswami Alphones
author_facet	Arokiaswami Alphones Low, Yuki Yu Jun
format	Final Year Project
author	Low, Yuki Yu Jun
author_sort	Low, Yuki Yu Jun
title	Deep learning speech enhancement in satellite radio communication
title_short	Deep learning speech enhancement in satellite radio communication
title_full	Deep learning speech enhancement in satellite radio communication
title_fullStr	Deep learning speech enhancement in satellite radio communication
title_full_unstemmed	Deep learning speech enhancement in satellite radio communication
title_sort	deep learning speech enhancement in satellite radio communication
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/157874
_version_	1772828965576114176

Deep learning speech enhancement in satellite radio communication

Similar Items