Deep learning speech enhancement in satellite radio communication

Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the s...

Full description

Saved in:
Bibliographic Details
Main Author: Low, Yuki Yu Jun
Other Authors: Arokiaswami Alphones
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/157874
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-157874
record_format dspace
spelling sg-ntu-dr.10356-1578742023-07-07T19:07:23Z Deep learning speech enhancement in satellite radio communication Low, Yuki Yu Jun Arokiaswami Alphones School of Electrical and Electronic Engineering EAlphones@ntu.edu.sg Engineering::Electrical and electronic engineering Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the speech enhancement model. In this report, we study about the source of noise in an audio when air pilot controllers communicate and methods used for speech enhancement, mainly Wave-U-Net and a hybrid Recurrent Neural Network-based model. Wave-U-Net is a multi-scale neural network that provides end-to-end audio source separation, which is a modification from U-Net. Wave-U-Net repeatedly resamples feature maps to calculate and integrate features at different time scales [1]. The RNN-based model uses a hybrid of deep learning in conjunction with the basics of audio signal processing. Our experiment shows that the proposed Wave-U-Net method improves the audio quality consistently with PESQ metric – a test methodology that automatically assess speech quality when compared to the hybrid RNN-based model. Bachelor of Engineering (Information Engineering and Media) 2022-05-24T12:42:43Z 2022-05-24T12:42:43Z 2022 Final Year Project (FYP) Low, Y. Y. J. (2022). Deep learning speech enhancement in satellite radio communication. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/157874 https://hdl.handle.net/10356/157874 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Low, Yuki Yu Jun
Deep learning speech enhancement in satellite radio communication
description Clarity and intelligibility are critical aspects of speech. Deep learning models for speech enhancement uses different algorithms to improve the speech quality significantly before reaching the listener. Machine learning knowledge is crucial in generating models to predict the outcome of the speech enhancement model. In this report, we study about the source of noise in an audio when air pilot controllers communicate and methods used for speech enhancement, mainly Wave-U-Net and a hybrid Recurrent Neural Network-based model. Wave-U-Net is a multi-scale neural network that provides end-to-end audio source separation, which is a modification from U-Net. Wave-U-Net repeatedly resamples feature maps to calculate and integrate features at different time scales [1]. The RNN-based model uses a hybrid of deep learning in conjunction with the basics of audio signal processing. Our experiment shows that the proposed Wave-U-Net method improves the audio quality consistently with PESQ metric – a test methodology that automatically assess speech quality when compared to the hybrid RNN-based model.
author2 Arokiaswami Alphones
author_facet Arokiaswami Alphones
Low, Yuki Yu Jun
format Final Year Project
author Low, Yuki Yu Jun
author_sort Low, Yuki Yu Jun
title Deep learning speech enhancement in satellite radio communication
title_short Deep learning speech enhancement in satellite radio communication
title_full Deep learning speech enhancement in satellite radio communication
title_fullStr Deep learning speech enhancement in satellite radio communication
title_full_unstemmed Deep learning speech enhancement in satellite radio communication
title_sort deep learning speech enhancement in satellite radio communication
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/157874
_version_ 1772828965576114176