Classification on distressed sounds with CNN/RNN
Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/139181 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds using Machine Learning. This project can be integrated with a sound-based security system to help those people who need urgent help or assistance. In this report, it focuses on how to classify a distressed sound using the Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In particular, the dataset was collected for 3 distressed sounds, “Help”, “Crying” and “Screaming”, and then built a model to determine which distressed sound among them. The model to be implemented is a VGG which is widely used in audio classification. The report shows how to convert an audio classification problem to image recognition, where the fully developed techniques of CNN and RNN can be applied better. In the end, the performance of these two networks was evaluated based on several properties. CNN model performs better in overall with 94% training accuracy and 85% testing accuracy. |
---|