Classification of distressed sounds using CNN/C-RNN

Safety is always the utmost priority in this world where dangers are all around. There may be incidents of snakes, falling trees and even car crashing that may endanger one life. With improvements in quality of life in Singapore, the response from emergency personnel will arrive swiftly when contact...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Loh, Zhen Ann
مؤلفون آخرون:	Er Meng Hwa
التنسيق:	Final Year Project
اللغة:	English
منشور في:	Nanyang Technological University 2021
الموضوعات:	Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/149513
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-149513
record_format	dspace
spelling	sg-ntu-dr.10356-1495132023-07-07T18:19:27Z Classification of distressed sounds using CNN/C-RNN Loh, Zhen Ann Er Meng Hwa Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg, EMHER@ntu.edu.sg Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Safety is always the utmost priority in this world where dangers are all around. There may be incidents of snakes, falling trees and even car crashing that may endanger one life. With improvements in quality of life in Singapore, the response from emergency personnel will arrive swiftly when contacted by the victim or other people. However, imagine if it were to occur in a deserted area or in a factory where no one else is present and the victim could not obtain help from any means of communication, this project will provide the solution. By having a trained distressed sounds classifier, distressed sounds can be detected so that the investigation team or emergency personnel can seek the victim. Integration of distressed sound detection in a sound-based surveillance system can thus be implemented at several places like factories and deserted areas to extend assistance to people who are distressed, in pain or danger [1]. Hence, this project discusses the development and usage of machine learning techniques, Convolutional Neural Network (CNN) and Convolutional-Recurrent Neural Network (CRNN) model to classify distressed sounds in Singapore’s soundscape. These distressed sounds are categorized into 4 classes: non-distressed sounds, ‘Crying’, ‘Help’, and ‘Screaming’. Furthermore, the models to be implemented are inspired by VGG [2] which is widely used in image and audio classification. In general, this report shows the process of transforming audio classification into an image classification problem where CNN and CRNN can be utilized efficiently. In the end, the performance of these networks was evaluated based on several metrics but unfortunately, they have not shown a feasible result that can be implemented in real-time. CNN and CRNN models have only scored F_β score of 0.3377 and 0.3225 respectively when beta is 2. Keyword: Audio classification, Distressed Sounds, Deep Neural Network Bachelor of Engineering (Electrical and Electronic Engineering) 2021-06-02T08:28:48Z 2021-06-02T08:28:48Z 2021 Final Year Project (FYP) Loh, Z. A. (2021). Classification of distressed sounds using CNN/C-RNN. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149513 https://hdl.handle.net/10356/149513 en A3080-201 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Loh, Zhen Ann Classification of distressed sounds using CNN/C-RNN
description	Safety is always the utmost priority in this world where dangers are all around. There may be incidents of snakes, falling trees and even car crashing that may endanger one life. With improvements in quality of life in Singapore, the response from emergency personnel will arrive swiftly when contacted by the victim or other people. However, imagine if it were to occur in a deserted area or in a factory where no one else is present and the victim could not obtain help from any means of communication, this project will provide the solution. By having a trained distressed sounds classifier, distressed sounds can be detected so that the investigation team or emergency personnel can seek the victim. Integration of distressed sound detection in a sound-based surveillance system can thus be implemented at several places like factories and deserted areas to extend assistance to people who are distressed, in pain or danger [1]. Hence, this project discusses the development and usage of machine learning techniques, Convolutional Neural Network (CNN) and Convolutional-Recurrent Neural Network (CRNN) model to classify distressed sounds in Singapore’s soundscape. These distressed sounds are categorized into 4 classes: non-distressed sounds, ‘Crying’, ‘Help’, and ‘Screaming’. Furthermore, the models to be implemented are inspired by VGG [2] which is widely used in image and audio classification. In general, this report shows the process of transforming audio classification into an image classification problem where CNN and CRNN can be utilized efficiently. In the end, the performance of these networks was evaluated based on several metrics but unfortunately, they have not shown a feasible result that can be implemented in real-time. CNN and CRNN models have only scored F_β score of 0.3377 and 0.3225 respectively when beta is 2. Keyword: Audio classification, Distressed Sounds, Deep Neural Network
author2	Er Meng Hwa
author_facet	Er Meng Hwa Loh, Zhen Ann
format	Final Year Project
author	Loh, Zhen Ann
author_sort	Loh, Zhen Ann
title	Classification of distressed sounds using CNN/C-RNN
title_short	Classification of distressed sounds using CNN/C-RNN
title_full	Classification of distressed sounds using CNN/C-RNN
title_fullStr	Classification of distressed sounds using CNN/C-RNN
title_full_unstemmed	Classification of distressed sounds using CNN/C-RNN
title_sort	classification of distressed sounds using cnn/c-rnn
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/149513
_version_	1772825650164400128

Classification of distressed sounds using CNN/C-RNN

مواد مشابهة