What is that sound? An automated sound classification system using machine learning
In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/150286 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-150286 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1502862023-07-07T18:01:10Z What is that sound? An automated sound classification system using machine learning Cher, Jensen Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which can differentiate different genres of sounds. The project aims to create and investigate machine learning models and algorithms with the capability to detect and classify different types of sound based on their audio features. In this report, machine learning models such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and a combination of both Convolutional Recurrent Neural Network (CRNN) were applied as researches has been done upon CNN and RNN individually and combination of both, the researchers found the combination of both produces better results. A combination of the MFCC feature set and the machine learning models mentioned was chosen to be the main methodology idea for this project. The overall outcome of this project was satisfactory with an accuracy achieving 72% classification results on the CRNN model. Improvements to the dataset and the models can still be made to enhance and upgrade the models’ accuracy, efficiency, and performance. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-06-09T09:44:12Z 2021-06-09T09:44:12Z 2021 Final Year Project (FYP) Cher, J. (2021). What is that sound? An automated sound classification system using machine learning. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/150286 https://hdl.handle.net/10356/150286 en A3015-201 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Cher, Jensen What is that sound? An automated sound classification system using machine learning |
description |
In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which can differentiate different genres of sounds. The project aims to create and investigate machine learning models and algorithms with the capability to detect and classify different types of sound based on their audio features. In this report, machine learning models such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and a combination of both Convolutional Recurrent Neural Network (CRNN) were applied as researches has been done upon CNN and RNN individually and combination of both, the researchers found the combination of both produces better results. A combination of the MFCC feature set and the machine learning models mentioned was chosen to be the main methodology idea for this project. The overall outcome of this project was satisfactory with an accuracy achieving 72% classification results on the CRNN model. Improvements to the dataset and the models can still be made to enhance and upgrade the models’ accuracy, efficiency, and performance. |
author2 |
Andy Khong W H |
author_facet |
Andy Khong W H Cher, Jensen |
format |
Final Year Project |
author |
Cher, Jensen |
author_sort |
Cher, Jensen |
title |
What is that sound? An automated sound classification system using machine learning |
title_short |
What is that sound? An automated sound classification system using machine learning |
title_full |
What is that sound? An automated sound classification system using machine learning |
title_fullStr |
What is that sound? An automated sound classification system using machine learning |
title_full_unstemmed |
What is that sound? An automated sound classification system using machine learning |
title_sort |
what is that sound? an automated sound classification system using machine learning |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/150286 |
_version_ |
1772825108498350080 |