Audio intelligence & domain adaptation for deep learning models at the edge
Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. T...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/152683 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-152683 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1526832023-07-04T16:47:59Z Audio intelligence & domain adaptation for deep learning models at the edge Ng, Linus JunJia Gan Woon Seng School of Electrical and Electronic Engineering Centre for Infocomm Technology (INFINITUS) EWSGAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data. Master of Engineering 2021-09-14T07:14:33Z 2021-09-14T07:14:33Z 2021 Thesis-Master by Research Ng, L. J. (2021). Audio intelligence & domain adaptation for deep learning models at the edge. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152683 https://hdl.handle.net/10356/152683 10.32657/10356/152683 en MOE2017-T2-2-060 This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Ng, Linus JunJia Audio intelligence & domain adaptation for deep learning models at the edge |
description |
Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of
the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data. |
author2 |
Gan Woon Seng |
author_facet |
Gan Woon Seng Ng, Linus JunJia |
format |
Thesis-Master by Research |
author |
Ng, Linus JunJia |
author_sort |
Ng, Linus JunJia |
title |
Audio intelligence & domain adaptation for deep learning models at the edge |
title_short |
Audio intelligence & domain adaptation for deep learning models at the edge |
title_full |
Audio intelligence & domain adaptation for deep learning models at the edge |
title_fullStr |
Audio intelligence & domain adaptation for deep learning models at the edge |
title_full_unstemmed |
Audio intelligence & domain adaptation for deep learning models at the edge |
title_sort |
audio intelligence & domain adaptation for deep learning models at the edge |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/152683 |
_version_ |
1772826056892350464 |