Urban sound tagging
Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City. The main goal of the compet...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/158226 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-158226 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1582262023-07-07T19:33:51Z Urban sound tagging Lim, Cheng Wei Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City. The main goal of the competition is to write a computer program to determine whether each of the 23 noise tags is present or absent in the recording. The secondary goal is to classify the 23 fine-grained noise tags and 8 coarse-grained tags. It is sometimes difficult for human to differentiate the closely related noise tags without the use of computer program. For instance, small, medium, and large engines are three fine-grained tags from the coarse-grained engine tag. The absence of noise tag is encoded as 0, while the presence of noise tag is encoded as 1. This report will cover the extraction of baseline Python code using the Git Bash and the Anaconda Juypter Notebook. The interpretations of the Python code to determine the hyperparameters and the model structure of the baseline. The outputs produced by the baseline model code in terms of SoftMax values and loss values. Lastly, the future work and the learning outcome. All the Python codes and the Urban Sound Tagging descriptions in this report were taken from the DCASE community website. [1] Bachelor of Engineering (Electrical and Electronic Engineering) 2022-06-01T12:50:21Z 2022-06-01T12:50:21Z 2022 Final Year Project (FYP) Lim, C. W. (2022). Urban sound tagging. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158226 https://hdl.handle.net/10356/158226 en A3080-211 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems |
spellingShingle |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems Lim, Cheng Wei Urban sound tagging |
description |
Urban Sound Tagging (UST) seeks to determine whether each of 23 noise sources is present or absent in a 10-second noise by an acoustic sensor network. The 23 noise tags are a multi-label classification problem, and they are common noise complaints in the New York City.
The main goal of the competition is to write a computer program to determine whether each of the 23 noise tags is present or absent in the recording. The secondary goal is to classify the 23 fine-grained noise tags and 8 coarse-grained tags. It is sometimes difficult for human to differentiate the closely related noise tags without the use of computer program. For instance, small, medium, and large engines are three fine-grained tags from the coarse-grained engine tag. The absence of noise tag is encoded as 0, while the presence of noise tag is encoded as 1.
This report will cover the extraction of baseline Python code using the Git Bash and the Anaconda Juypter Notebook. The interpretations of the Python code to determine the hyperparameters and the model structure of the baseline. The outputs produced by the baseline model code in terms of SoftMax values and loss values. Lastly, the future work and the learning outcome.
All the Python codes and the Urban Sound Tagging descriptions in this report were taken from the DCASE community website. [1] |
author2 |
Gan Woon Seng |
author_facet |
Gan Woon Seng Lim, Cheng Wei |
format |
Final Year Project |
author |
Lim, Cheng Wei |
author_sort |
Lim, Cheng Wei |
title |
Urban sound tagging |
title_short |
Urban sound tagging |
title_full |
Urban sound tagging |
title_fullStr |
Urban sound tagging |
title_full_unstemmed |
Urban sound tagging |
title_sort |
urban sound tagging |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/158226 |
_version_ |
1772826358234218496 |