CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing

Speech enhancement algorithms are applied in multiple levels of enhancement to improve the quality of speech signals under noisy environments known as multichannel speech enhancement (MCSE) systems. Numerous existing algorithms are used to filter noise in speech enhancement systems, which are typica...

Full description

Saved in:

Bibliographic Details
Main Authors:	Cherukuru, Pavani, Mustafa, Mumtaz Begum
Format:	Article
Published:	PeerJ 2024
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.um.edu.my/45548/ https://doi.org/10.7717/peerj-cs.1901
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaya

id	my.um.eprints.45548
record_format	eprints
spelling	my.um.eprints.455482024-10-28T08:29:07Z http://eprints.um.edu.my/45548/ CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing Cherukuru, Pavani Mustafa, Mumtaz Begum QA75 Electronic computers. Computer science Speech enhancement algorithms are applied in multiple levels of enhancement to improve the quality of speech signals under noisy environments known as multichannel speech enhancement (MCSE) systems. Numerous existing algorithms are used to filter noise in speech enhancement systems, which are typically employed as a preprocessor to reduce noise and improve speech quality. They may, however, be limited in performing well under low signal-to-noise ratio (SNR) situations. The speech devices are exposed to all kinds of environmental noises which may go up to a high-level frequency of noises. The objective of this research is to conduct a noise reduction experiment for a multi-channel speech enhancement (MCSE) system in stationary and non-stationary environmental noisy situations with varying speech signal SNR levels. The experiments examined the performance of the existing and the proposed MCSE systems for environmental noises in filtering low to high SNRs environmental noises (-10 dB to 20 dB). The experiments were conducted using the AURORA and LibriSpeech datasets, which consist of different types of environmental noises. The existing MCSE (BAV-MCSE) makes use of beamforming, adaptive noise reduction and voice activity detection algorithms (BAV) to filter the noises from speech signals. The proposed MCSE (DWT-CNN-MCSE) system was developed based on discrete wavelet transform (DWT) preprocessing and convolution neural network (CNN) for denoising the input noisy speech signals to improve the performance accuracy. The performance of the existing BAV-MCSE and the proposed DWT-CNN-MCSE were measured using spectrogram analysis and word recognition rate (WRR). It was identified that the existing BAV-MCSE reported the highest WRR at 93.77% for a high SNR (at 20 dB) and 5.64% on average for a low SNR (at -10 dB) for different noises. The proposed DWT-CNN-MCSE system has proven to perform well at a low SNR with WRR of 70.55% and the highest improvement (64.91% WRR) at -10 dB SNR. PeerJ 2024-02 Article PeerReviewed Cherukuru, Pavani and Mustafa, Mumtaz Begum (2024) CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing. PeerJ Computer Science, 10. e1901. ISSN 2376-5992, DOI https://doi.org/10.7717/peerj-cs.1901 <https://doi.org/10.7717/peerj-cs.1901>. https://doi.org/10.7717/peerj-cs.1901 10.7717/peerj-cs.1901
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Cherukuru, Pavani Mustafa, Mumtaz Begum CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
description	Speech enhancement algorithms are applied in multiple levels of enhancement to improve the quality of speech signals under noisy environments known as multichannel speech enhancement (MCSE) systems. Numerous existing algorithms are used to filter noise in speech enhancement systems, which are typically employed as a preprocessor to reduce noise and improve speech quality. They may, however, be limited in performing well under low signal-to-noise ratio (SNR) situations. The speech devices are exposed to all kinds of environmental noises which may go up to a high-level frequency of noises. The objective of this research is to conduct a noise reduction experiment for a multi-channel speech enhancement (MCSE) system in stationary and non-stationary environmental noisy situations with varying speech signal SNR levels. The experiments examined the performance of the existing and the proposed MCSE systems for environmental noises in filtering low to high SNRs environmental noises (-10 dB to 20 dB). The experiments were conducted using the AURORA and LibriSpeech datasets, which consist of different types of environmental noises. The existing MCSE (BAV-MCSE) makes use of beamforming, adaptive noise reduction and voice activity detection algorithms (BAV) to filter the noises from speech signals. The proposed MCSE (DWT-CNN-MCSE) system was developed based on discrete wavelet transform (DWT) preprocessing and convolution neural network (CNN) for denoising the input noisy speech signals to improve the performance accuracy. The performance of the existing BAV-MCSE and the proposed DWT-CNN-MCSE were measured using spectrogram analysis and word recognition rate (WRR). It was identified that the existing BAV-MCSE reported the highest WRR at 93.77% for a high SNR (at 20 dB) and 5.64% on average for a low SNR (at -10 dB) for different noises. The proposed DWT-CNN-MCSE system has proven to perform well at a low SNR with WRR of 70.55% and the highest improvement (64.91% WRR) at -10 dB SNR.
format	Article
author	Cherukuru, Pavani Mustafa, Mumtaz Begum
author_facet	Cherukuru, Pavani Mustafa, Mumtaz Begum
author_sort	Cherukuru, Pavani
title	CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
title_short	CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
title_full	CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
title_fullStr	CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
title_full_unstemmed	CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing
title_sort	cnn-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (dwt) preprocessing
publisher	PeerJ
publishDate	2024
url	http://eprints.um.edu.my/45548/ https://doi.org/10.7717/peerj-cs.1901
_version_	1814933238368436224

CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing

Similar Items