An efficient dilated convolutional neural network for UAV noise reduction at low input SNR
Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the sm...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/141755 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-141755 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1417552020-06-10T07:46:35Z An efficient dilated convolutional neural network for UAV noise reduction at low input SNR Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong School of Electrical and Electronic Engineering 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ST Engineering-NTU Corporate Lab Engineering::Electrical and electronic engineering Noise Measurement Unmanned Aerial Vehicle Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the small model on low SNR (SMoLnet), a compact convolutional neural network (CNN) to suppress UAV noise in noisy speech signals recorded off a microphone array mounted on the UAV. The proposed SMoLnet employs a large analysis window to achieve high spectral resolution since the loud UAV noise exhibits a narrow-band harmonic pattern. In the proposed SMoLnet model, exponentially-increasing dilated convolution layers were adopted to capture the global relationship across the frequency dimension. Furthermore, we performed direct spectral mapping between noisy and clean complex spectrogram to cater to the low SNR scenario. Simulation results show that the proposed SMoLnet outperforms existing dilation-based models in terms of speech quality and objective speech intelligibility metrics for UAV noise reduction. In addition, the proposed SMoLnet requires fewer parameters and achieves lower latency than the compared models. NRF (Natl Research Foundation, S’pore) Accepted version 2020-06-10T07:46:35Z 2020-06-10T07:46:35Z 2020 Conference Paper Tan, Z.-W., Nguyen, A. H. T., & Khong, A. W. H. (2019). An efficient dilated convolutional neural network for UAV noise reduction at low input SNR. Proceedings of 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 1885-1892. doi:10.1109/APSIPAASC47483.2019.9023324 978-1-7281-3249-5 2640-009X https://hdl.handle.net/10356/141755 10.1109/APSIPAASC47483.2019.9023324 1885 1892 en MRP14 © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/APSIPAASC47483.2019.9023324 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Noise Measurement Unmanned Aerial Vehicle |
spellingShingle |
Engineering::Electrical and electronic engineering Noise Measurement Unmanned Aerial Vehicle Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
description |
Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the small model on low SNR (SMoLnet), a compact convolutional neural network (CNN) to suppress UAV noise in noisy speech signals recorded off a microphone array mounted on the UAV. The proposed SMoLnet employs a large analysis window to achieve high spectral resolution since the loud UAV noise exhibits a narrow-band harmonic pattern. In the proposed SMoLnet model, exponentially-increasing dilated convolution layers were adopted to capture the global relationship across the frequency dimension. Furthermore, we performed direct spectral mapping between noisy and clean complex spectrogram to cater to the low SNR scenario. Simulation results show that the proposed SMoLnet outperforms existing dilation-based models in terms of speech quality and objective speech intelligibility metrics for UAV noise reduction. In addition, the proposed SMoLnet requires fewer parameters and achieves lower latency than the compared models. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong |
format |
Conference or Workshop Item |
author |
Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong |
author_sort |
Tan, Zhi-Wei |
title |
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
title_short |
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
title_full |
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
title_fullStr |
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
title_full_unstemmed |
An efficient dilated convolutional neural network for UAV noise reduction at low input SNR |
title_sort |
efficient dilated convolutional neural network for uav noise reduction at low input snr |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/141755 |
_version_ |
1681058980410949632 |