Urban sound tagging with spatiotemporal context

As Singapore continues to develop by building more residential and commercial infrastructure, noise caused by construction or traffic is unavoidable. To better control the noise, city planners will need to perform analysis to monitor the city's noise so that the citizens’ quality of life w...

Full description

Saved in:
Bibliographic Details
Main Author: Kwok, Li Long
Other Authors: Gan Woon Seng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149470
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-149470
record_format dspace
spelling sg-ntu-dr.10356-1494702023-07-07T18:15:07Z Urban sound tagging with spatiotemporal context Kwok, Li Long Gan Woon Seng School of Electrical and Electronic Engineering Digital Signal Processing Laboratory EWSGAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Pattern recognition As Singapore continues to develop by building more residential and commercial infrastructure, noise caused by construction or traffic is unavoidable. To better control the noise, city planners will need to perform analysis to monitor the city's noise so that the citizens’ quality of life would not be affected. For the city planner to better understand and mitigate the noise pollution, an Urban Sound Tagging (UST) system was developed. The UST can be used to tag audio recordings of the city automatically. With the UST system, city planner can easily monitor the noise pollution in the city. In Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 Task 5, many participants have developed UST that can tag on 10-second-long audio tracks to understand the city's noise profile. To boost the performance of the UST, DCASE 2020 Task 5 proposed a challenge named 'Urban Sound Tagging with Spatiotemporal Context'. This task was a follow up work from the previous year with additional metadata such as Spatiotemporal (STC) metadata. The task aims to investigate whether the STC metadata would aid in the performance of the UST system. With the inspiration from DCASE 2020 Task 5, this project investigates the proposed models from DCASE 2020 Task 5 to observe whether there was a performance boost by using STC metadata. Therefore, an experiment will be conducted by training different metadata and the performance will be evaluated. In addition, the adaptability of the models will be evaluated as well by introducing a new dataset named the Sound of Singapore (SGS) dataset to the model for tagging of soundscape that was recorded in Singapore. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-05-31T11:51:39Z 2021-05-31T11:51:39Z 2021 Final Year Project (FYP) Kwok, L. L. (2021). Urban sound tagging with spatiotemporal context. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149470 https://hdl.handle.net/10356/149470 en A3089-201 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Kwok, Li Long
Urban sound tagging with spatiotemporal context
description As Singapore continues to develop by building more residential and commercial infrastructure, noise caused by construction or traffic is unavoidable. To better control the noise, city planners will need to perform analysis to monitor the city's noise so that the citizens’ quality of life would not be affected. For the city planner to better understand and mitigate the noise pollution, an Urban Sound Tagging (UST) system was developed. The UST can be used to tag audio recordings of the city automatically. With the UST system, city planner can easily monitor the noise pollution in the city. In Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 Task 5, many participants have developed UST that can tag on 10-second-long audio tracks to understand the city's noise profile. To boost the performance of the UST, DCASE 2020 Task 5 proposed a challenge named 'Urban Sound Tagging with Spatiotemporal Context'. This task was a follow up work from the previous year with additional metadata such as Spatiotemporal (STC) metadata. The task aims to investigate whether the STC metadata would aid in the performance of the UST system. With the inspiration from DCASE 2020 Task 5, this project investigates the proposed models from DCASE 2020 Task 5 to observe whether there was a performance boost by using STC metadata. Therefore, an experiment will be conducted by training different metadata and the performance will be evaluated. In addition, the adaptability of the models will be evaluated as well by introducing a new dataset named the Sound of Singapore (SGS) dataset to the model for tagging of soundscape that was recorded in Singapore.
author2 Gan Woon Seng
author_facet Gan Woon Seng
Kwok, Li Long
format Final Year Project
author Kwok, Li Long
author_sort Kwok, Li Long
title Urban sound tagging with spatiotemporal context
title_short Urban sound tagging with spatiotemporal context
title_full Urban sound tagging with spatiotemporal context
title_fullStr Urban sound tagging with spatiotemporal context
title_full_unstemmed Urban sound tagging with spatiotemporal context
title_sort urban sound tagging with spatiotemporal context
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/149470
_version_ 1772825261692157952