Urban noise tagging and perceptually informed unsupervised clustering

The rapid urbanisation process has led to an increase in noise pollution in urban environments, impacting the well-being and quality of life of city residents. To address this issue, researchers have focused on understanding and categorising urban noise to mitigate its effects effectively. Convoluti...

Full description

Saved in:

Bibliographic Details
Main Author:	Quek, Gordon
Other Authors:	Gan Woon Seng
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Engineering
Online Access:	https://hdl.handle.net/10356/176981
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-176981
record_format	dspace
spelling	sg-ntu-dr.10356-1769812024-05-24T15:45:37Z Urban noise tagging and perceptually informed unsupervised clustering Quek, Gordon Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering The rapid urbanisation process has led to an increase in noise pollution in urban environments, impacting the well-being and quality of life of city residents. To address this issue, researchers have focused on understanding and categorising urban noise to mitigate its effects effectively. Convolutional Neural Networks (CNNs) have emerged as a popular choice for audio classification tasks due to their ability to learn structural patterns from audio data. Despite efforts in the field, there is a lack of research on urban audio data from specific locations, such as Singapore, which has unique acoustic features influenced by local factors. This project aims to enhance the performance of CNNs for audio classification using the SINGA:PURA dataset, which consists of audio recordings from the Singapore environment. Using an initial code for machine learning audio classification adapted from Kenneth's DCASE Task1B from 2020 and modified to adapt to the SINGA:PURA dataset. Experiments are conducted to modify model parameters, data preprocessing techniques, and CNN architectures. Results show that a learning rate of 0.001, a batch size of 60, and employing 2 to 4 convolutional layers with 160 mel filters yield optimal performance for classifying Singaporean acoustic environments. However, despite achieving a training accuracy of 80% and a testing accuracy of 46%, overfitting is observed, indicating the need for further research. Recommendations for future work include addressing dataset imbalance through additional data augmentation techniques, refining data preprocessing methods, exploring a broader range of hyperparameters, and leveraging pre-trained CNN models such as VGG16, ResNet, or InceptionV3. These approaches could provide valuable insights into optimising model performance and improving classification accuracy for urban noise tagging and mitigation efforts. Bachelor's degree 2024-05-23T23:05:47Z 2024-05-23T23:05:47Z 2024 Final Year Project (FYP) Quek, G. (2024). Urban noise tagging and perceptually informed unsupervised clustering. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176981 https://hdl.handle.net/10356/176981 en A3063-231 10.21979/N9/Y8UQ6F application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering
spellingShingle	Engineering Quek, Gordon Urban noise tagging and perceptually informed unsupervised clustering
description	The rapid urbanisation process has led to an increase in noise pollution in urban environments, impacting the well-being and quality of life of city residents. To address this issue, researchers have focused on understanding and categorising urban noise to mitigate its effects effectively. Convolutional Neural Networks (CNNs) have emerged as a popular choice for audio classification tasks due to their ability to learn structural patterns from audio data. Despite efforts in the field, there is a lack of research on urban audio data from specific locations, such as Singapore, which has unique acoustic features influenced by local factors. This project aims to enhance the performance of CNNs for audio classification using the SINGA:PURA dataset, which consists of audio recordings from the Singapore environment. Using an initial code for machine learning audio classification adapted from Kenneth's DCASE Task1B from 2020 and modified to adapt to the SINGA:PURA dataset. Experiments are conducted to modify model parameters, data preprocessing techniques, and CNN architectures. Results show that a learning rate of 0.001, a batch size of 60, and employing 2 to 4 convolutional layers with 160 mel filters yield optimal performance for classifying Singaporean acoustic environments. However, despite achieving a training accuracy of 80% and a testing accuracy of 46%, overfitting is observed, indicating the need for further research. Recommendations for future work include addressing dataset imbalance through additional data augmentation techniques, refining data preprocessing methods, exploring a broader range of hyperparameters, and leveraging pre-trained CNN models such as VGG16, ResNet, or InceptionV3. These approaches could provide valuable insights into optimising model performance and improving classification accuracy for urban noise tagging and mitigation efforts.
author2	Gan Woon Seng
author_facet	Gan Woon Seng Quek, Gordon
format	Final Year Project
author	Quek, Gordon
author_sort	Quek, Gordon
title	Urban noise tagging and perceptually informed unsupervised clustering
title_short	Urban noise tagging and perceptually informed unsupervised clustering
title_full	Urban noise tagging and perceptually informed unsupervised clustering
title_fullStr	Urban noise tagging and perceptually informed unsupervised clustering
title_full_unstemmed	Urban noise tagging and perceptually informed unsupervised clustering
title_sort	urban noise tagging and perceptually informed unsupervised clustering
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/176981
_version_	1806059808162316288

Urban noise tagging and perceptually informed unsupervised clustering

Similar Items