Audio intelligence & domain adaptation for deep learning models at the edge

Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. T...

Full description

Saved in:
Bibliographic Details
Main Author: Ng, Linus JunJia
Other Authors: Gan Woon Seng
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/152683
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-152683
record_format dspace
spelling sg-ntu-dr.10356-1526832023-07-04T16:47:59Z Audio intelligence & domain adaptation for deep learning models at the edge Ng, Linus JunJia Gan Woon Seng School of Electrical and Electronic Engineering Centre for Infocomm Technology (INFINITUS) EWSGAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data. Master of Engineering 2021-09-14T07:14:33Z 2021-09-14T07:14:33Z 2021 Thesis-Master by Research Ng, L. J. (2021). Audio intelligence & domain adaptation for deep learning models at the edge. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152683 https://hdl.handle.net/10356/152683 10.32657/10356/152683 en MOE2017-T2-2-060 This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Ng, Linus JunJia
Audio intelligence & domain adaptation for deep learning models at the edge
description Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data.
author2 Gan Woon Seng
author_facet Gan Woon Seng
Ng, Linus JunJia
format Thesis-Master by Research
author Ng, Linus JunJia
author_sort Ng, Linus JunJia
title Audio intelligence & domain adaptation for deep learning models at the edge
title_short Audio intelligence & domain adaptation for deep learning models at the edge
title_full Audio intelligence & domain adaptation for deep learning models at the edge
title_fullStr Audio intelligence & domain adaptation for deep learning models at the edge
title_full_unstemmed Audio intelligence & domain adaptation for deep learning models at the edge
title_sort audio intelligence & domain adaptation for deep learning models at the edge
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/152683
_version_ 1772826056892350464