An efficient dilated convolutional neural network for UAV noise reduction at low input SNR

Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the sm...

Full description

Saved in:
Bibliographic Details
Main Authors: Tan, Zhi-Wei, Nguyen, Anh Hai Trieu, Khong, Andy Wai Hoong
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141755
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141755
record_format dspace
spelling sg-ntu-dr.10356-1417552020-06-10T07:46:35Z An efficient dilated convolutional neural network for UAV noise reduction at low input SNR Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong School of Electrical and Electronic Engineering 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ST Engineering-NTU Corporate Lab Engineering::Electrical and electronic engineering Noise Measurement Unmanned Aerial Vehicle Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the small model on low SNR (SMoLnet), a compact convolutional neural network (CNN) to suppress UAV noise in noisy speech signals recorded off a microphone array mounted on the UAV. The proposed SMoLnet employs a large analysis window to achieve high spectral resolution since the loud UAV noise exhibits a narrow-band harmonic pattern. In the proposed SMoLnet model, exponentially-increasing dilated convolution layers were adopted to capture the global relationship across the frequency dimension. Furthermore, we performed direct spectral mapping between noisy and clean complex spectrogram to cater to the low SNR scenario. Simulation results show that the proposed SMoLnet outperforms existing dilation-based models in terms of speech quality and objective speech intelligibility metrics for UAV noise reduction. In addition, the proposed SMoLnet requires fewer parameters and achieves lower latency than the compared models. NRF (Natl Research Foundation, S’pore) Accepted version 2020-06-10T07:46:35Z 2020-06-10T07:46:35Z 2020 Conference Paper Tan, Z.-W., Nguyen, A. H. T., & Khong, A. W. H. (2019). An efficient dilated convolutional neural network for UAV noise reduction at low input SNR. Proceedings of 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 1885-1892. doi:10.1109/APSIPAASC47483.2019.9023324 978-1-7281-3249-5 2640-009X https://hdl.handle.net/10356/141755 10.1109/APSIPAASC47483.2019.9023324 1885 1892 en MRP14 © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/APSIPAASC47483.2019.9023324 application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Noise Measurement
Unmanned Aerial Vehicle
spellingShingle Engineering::Electrical and electronic engineering
Noise Measurement
Unmanned Aerial Vehicle
Tan, Zhi-Wei
Nguyen, Anh Hai Trieu
Khong, Andy Wai Hoong
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
description Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the small model on low SNR (SMoLnet), a compact convolutional neural network (CNN) to suppress UAV noise in noisy speech signals recorded off a microphone array mounted on the UAV. The proposed SMoLnet employs a large analysis window to achieve high spectral resolution since the loud UAV noise exhibits a narrow-band harmonic pattern. In the proposed SMoLnet model, exponentially-increasing dilated convolution layers were adopted to capture the global relationship across the frequency dimension. Furthermore, we performed direct spectral mapping between noisy and clean complex spectrogram to cater to the low SNR scenario. Simulation results show that the proposed SMoLnet outperforms existing dilation-based models in terms of speech quality and objective speech intelligibility metrics for UAV noise reduction. In addition, the proposed SMoLnet requires fewer parameters and achieves lower latency than the compared models.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Tan, Zhi-Wei
Nguyen, Anh Hai Trieu
Khong, Andy Wai Hoong
format Conference or Workshop Item
author Tan, Zhi-Wei
Nguyen, Anh Hai Trieu
Khong, Andy Wai Hoong
author_sort Tan, Zhi-Wei
title An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
title_short An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
title_full An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
title_fullStr An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
title_full_unstemmed An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
title_sort efficient dilated convolutional neural network for uav noise reduction at low input snr
publishDate 2020
url https://hdl.handle.net/10356/141755
_version_ 1681058980410949632