Machine learning / deep learning approach to soundscape analysis

Visual understanding of the soundscape environment is an enabling factor for a wide range of applications in studying how humans perceive sounds. Audiovisual scene decomposition allows further understanding of soundscape. This project will be focusing on the decomposition of urban soundscapes such a...

Full description

Saved in:
Bibliographic Details
Main Author: Koh, Cheng Yong
Other Authors: Gan Woon Seng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158057
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-158057
record_format dspace
spelling sg-ntu-dr.10356-1580572023-07-07T19:21:44Z Machine learning / deep learning approach to soundscape analysis Koh, Cheng Yong Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering::Electrical and electronic engineering Visual understanding of the soundscape environment is an enabling factor for a wide range of applications in studying how humans perceive sounds. Audiovisual scene decomposition allows further understanding of soundscape. This project will be focusing on the decomposition of urban soundscapes such as parks, plazas, streets, etc. As water sounds are a prominent sound source in urban landscapes, this project will add a new waterbody class to the segmentation model which do not currently exist in most multiclass urban semantic segmentation model. This project proposes the use of the DeepLabV3+ model, with a ResNet50 backbone, trained on an improved Cityscapes dataset to perform semantic segmentation for urban scene decomposition. The training dataset will include additional waterbody images on top of the original Cityscapes images. Bachelor of Engineering (Information Engineering and Media) 2022-05-26T05:49:40Z 2022-05-26T05:49:40Z 2022 Final Year Project (FYP) Koh, C. Y. (2022). Machine learning / deep learning approach to soundscape analysis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158057 https://hdl.handle.net/10356/158057 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Koh, Cheng Yong
Machine learning / deep learning approach to soundscape analysis
description Visual understanding of the soundscape environment is an enabling factor for a wide range of applications in studying how humans perceive sounds. Audiovisual scene decomposition allows further understanding of soundscape. This project will be focusing on the decomposition of urban soundscapes such as parks, plazas, streets, etc. As water sounds are a prominent sound source in urban landscapes, this project will add a new waterbody class to the segmentation model which do not currently exist in most multiclass urban semantic segmentation model. This project proposes the use of the DeepLabV3+ model, with a ResNet50 backbone, trained on an improved Cityscapes dataset to perform semantic segmentation for urban scene decomposition. The training dataset will include additional waterbody images on top of the original Cityscapes images.
author2 Gan Woon Seng
author_facet Gan Woon Seng
Koh, Cheng Yong
format Final Year Project
author Koh, Cheng Yong
author_sort Koh, Cheng Yong
title Machine learning / deep learning approach to soundscape analysis
title_short Machine learning / deep learning approach to soundscape analysis
title_full Machine learning / deep learning approach to soundscape analysis
title_fullStr Machine learning / deep learning approach to soundscape analysis
title_full_unstemmed Machine learning / deep learning approach to soundscape analysis
title_sort machine learning / deep learning approach to soundscape analysis
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/158057
_version_ 1772828235044749312