Urban sound tagging

Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City. The main goal of the compet...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Cheng Wei
Other Authors: Gan Woon Seng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158226
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-158226
record_format dspace
spelling sg-ntu-dr.10356-1582262023-07-07T19:33:51Z Urban sound tagging Lim, Cheng Wei Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City. The main goal of the competition is to write a computer program to determine whether each of the 23 noise tags is present or absent in the recording. The secondary goal is to classify the 23 fine-grained noise tags and 8 coarse-grained tags. It is sometimes difficult for human to differentiate the closely related noise tags without the use of computer program. For instance, small, medium, and large engines are three fine-grained tags from the coarse-grained engine tag. The absence of noise tag is encoded as 0, while the presence of noise tag is encoded as 1. This report will cover the extraction of baseline Python code using the Git Bash and the Anaconda Juypter Notebook. The interpretations of the Python code to determine the hyperparameters and the model structure of the baseline. The outputs produced by the baseline model code in terms of SoftMax values and loss values. Lastly, the future work and the learning outcome. All the Python codes and the Urban Sound Tagging descriptions in this report were taken from the DCASE community website. [1] Bachelor of Engineering (Electrical and Electronic Engineering) 2022-06-01T12:50:21Z 2022-06-01T12:50:21Z 2022 Final Year Project (FYP) Lim, C. W. (2022). Urban sound tagging. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158226 https://hdl.handle.net/10356/158226 en A3080-211 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering::Computer hardware, software and systems
spellingShingle Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Lim, Cheng Wei
Urban sound tagging
description Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City. The main goal of the competition is to write a computer program to determine whether each of the 23 noise tags is present or absent in the recording. The secondary goal is to classify the 23 fine-grained noise tags and 8 coarse-grained tags. It is sometimes difficult for human to differentiate the closely related noise tags without the use of computer program. For instance, small, medium, and large engines are three fine-grained tags from the coarse-grained engine tag. The absence of noise tag is encoded as 0, while the presence of noise tag is encoded as 1. This report will cover the extraction of baseline Python code using the Git Bash and the Anaconda Juypter Notebook. The interpretations of the Python code to determine the hyperparameters and the model structure of the baseline. The outputs produced by the baseline model code in terms of SoftMax values and loss values. Lastly, the future work and the learning outcome. All the Python codes and the Urban Sound Tagging descriptions in this report were taken from the DCASE community website. [1]
author2 Gan Woon Seng
author_facet Gan Woon Seng
Lim, Cheng Wei
format Final Year Project
author Lim, Cheng Wei
author_sort Lim, Cheng Wei
title Urban sound tagging
title_short Urban sound tagging
title_full Urban sound tagging
title_fullStr Urban sound tagging
title_full_unstemmed Urban sound tagging
title_sort urban sound tagging
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/158226
_version_ 1772826358234218496