A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset

Psychoacoustic indicators and spectrogram-based features are standard inputs to perceptual models for soundscape analysis. However, existing models in the soundscape literature are trained on different collections of input parameters and frequently mutually exclusive datasets, which complicates fair...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Ooi, Kenneth, Watcharasupat, Karn N., Lam, Bhan, Ong, Zhen-Ting, Gan, Woon-Seng
مؤلفون آخرون:	School of Electrical and Electronic Engineering
التنسيق:	Conference or Workshop Item
اللغة:	English
منشور في:	2023
الموضوعات:	Engineering::Electrical and electronic engineering Soundscape Augmentation Dataset Regression Classification Deep Neural Networks
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/164983 https://www.ica2022korea.org/data/Proceedings_A14.pdf https://ica2022korea.org/
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-164983
record_format	dspace
spelling	sg-ntu-dr.10356-1649832025-02-17T06:33:27Z A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset Ooi, Kenneth Watcharasupat, Karn N. Lam, Bhan Ong, Zhen-Ting Gan, Woon-Seng School of Electrical and Electronic Engineering 24th International Congress on Acoustics (ICA 2022) Digital Signal Processing Laboratory Engineering::Electrical and electronic engineering Soundscape Augmentation Dataset Regression Classification Deep Neural Networks Psychoacoustic indicators and spectrogram-based features are standard inputs to perceptual models for soundscape analysis. However, existing models in the soundscape literature are trained on different collections of input parameters and frequently mutually exclusive datasets, which complicates fair comparisons of model performance and generalizability to unseen data and soundscapes. Hence, we use the ARAUS dataset, a large-scale, publicly-available dataset of 25,440 responses to unique augmented soundscapes, as a common benchmark for comparison of the relative performance of a selection of input and model types used in previous soundscape studies, as well as deep neural network architectures commonly used for other acoustic tasks. The different model types were used in a regression task to predict “Eventfulness” ratings (as defined in ISO/TS 12913-3) given by participants as responses to the augmented soundscapes. Subsequently, we compared their performance as classification models for the classification of soundscapes into the quadrants generated by the Pleasantness-Eventfulness axes of the ISO/TS 12913-3 circumplex model. The five-fold cross-validation set of 25,200 responses and an independent test set of 240 responses making up the ARAUS dataset was used to facilitate unbiased comparisons. Ministry of National Development (MND) National Research Foundation (NRF) Published version This research is supported by the Singapore Ministry of National Development and the National Research Foundation, Prime Minister’s Office under the Cities of Tomorrow Research Programme (Award No. COT-V4-2020-1). Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not reflect the view of National Research Foundation, Singapore, and Ministry of National Development, Singapore. 2023-04-24T05:10:05Z 2023-04-24T05:10:05Z 2022 Conference Paper Ooi, K., Watcharasupat, K. N., Lam, B., Ong, Z. & Gan, W. (2022). A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset. 24th International Congress on Acoustics (ICA 2022). https://hdl.handle.net/10356/164983 https://www.ica2022korea.org/data/Proceedings_A14.pdf https://ica2022korea.org/ en COT-V4-2020-1 10.21979/N9/9OTEVX © 2022 The Author(s). Published by ICA. All rights reserved. This paper was published in Proceedings of the 24th International Congress on Acoustics (ICA 2022) and is made available with permission of The Author(s). application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Soundscape Augmentation Dataset Regression Classification Deep Neural Networks
spellingShingle	Engineering::Electrical and electronic engineering Soundscape Augmentation Dataset Regression Classification Deep Neural Networks Ooi, Kenneth Watcharasupat, Karn N. Lam, Bhan Ong, Zhen-Ting Gan, Woon-Seng A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
description	Psychoacoustic indicators and spectrogram-based features are standard inputs to perceptual models for soundscape analysis. However, existing models in the soundscape literature are trained on different collections of input parameters and frequently mutually exclusive datasets, which complicates fair comparisons of model performance and generalizability to unseen data and soundscapes. Hence, we use the ARAUS dataset, a large-scale, publicly-available dataset of 25,440 responses to unique augmented soundscapes, as a common benchmark for comparison of the relative performance of a selection of input and model types used in previous soundscape studies, as well as deep neural network architectures commonly used for other acoustic tasks. The different model types were used in a regression task to predict “Eventfulness” ratings (as defined in ISO/TS 12913-3) given by participants as responses to the augmented soundscapes. Subsequently, we compared their performance as classification models for the classification of soundscapes into the quadrants generated by the Pleasantness-Eventfulness axes of the ISO/TS 12913-3 circumplex model. The five-fold cross-validation set of 25,200 responses and an independent test set of 240 responses making up the ARAUS dataset was used to facilitate unbiased comparisons.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Ooi, Kenneth Watcharasupat, Karn N. Lam, Bhan Ong, Zhen-Ting Gan, Woon-Seng
format	Conference or Workshop Item
author	Ooi, Kenneth Watcharasupat, Karn N. Lam, Bhan Ong, Zhen-Ting Gan, Woon-Seng
author_sort	Ooi, Kenneth
title	A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
title_short	A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
title_full	A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
title_fullStr	A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
title_full_unstemmed	A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
title_sort	benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
publishDate	2023
url	https://hdl.handle.net/10356/164983 https://www.ica2022korea.org/data/Proceedings_A14.pdf https://ica2022korea.org/
_version_	1825619654382452736

A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset

مواد مشابهة