What is that sound? An automated sound classification system using machine learning

In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which...

Full description

Saved in:
Bibliographic Details
Main Author: Cher, Jensen
Other Authors: Andy Khong W H
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/150286
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-150286
record_format dspace
spelling sg-ntu-dr.10356-1502862023-07-07T18:01:10Z What is that sound? An automated sound classification system using machine learning Cher, Jensen Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which can differentiate different genres of sounds. The project aims to create and investigate machine learning models and algorithms with the capability to detect and classify different types of sound based on their audio features. In this report, machine learning models such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and a combination of both Convolutional Recurrent Neural Network (CRNN) were applied as researches has been done upon CNN and RNN individually and combination of both, the researchers found the combination of both produces better results. A combination of the MFCC feature set and the machine learning models mentioned was chosen to be the main methodology idea for this project. The overall outcome of this project was satisfactory with an accuracy achieving 72% classification results on the CRNN model. Improvements to the dataset and the models can still be made to enhance and upgrade the models’ accuracy, efficiency, and performance. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-06-09T09:44:12Z 2021-06-09T09:44:12Z 2021 Final Year Project (FYP) Cher, J. (2021). What is that sound? An automated sound classification system using machine learning. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/150286 https://hdl.handle.net/10356/150286 en A3015-201 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Cher, Jensen
What is that sound? An automated sound classification system using machine learning
description In modern times nowadays, the need for automation is becoming more prevalent as companies in the Information technology sector seek to produce automated solutions to improve their customers’ performance and productivity. One of the demands of today’s is the automation of audio classification, which can differentiate different genres of sounds. The project aims to create and investigate machine learning models and algorithms with the capability to detect and classify different types of sound based on their audio features. In this report, machine learning models such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and a combination of both Convolutional Recurrent Neural Network (CRNN) were applied as researches has been done upon CNN and RNN individually and combination of both, the researchers found the combination of both produces better results. A combination of the MFCC feature set and the machine learning models mentioned was chosen to be the main methodology idea for this project. The overall outcome of this project was satisfactory with an accuracy achieving 72% classification results on the CRNN model. Improvements to the dataset and the models can still be made to enhance and upgrade the models’ accuracy, efficiency, and performance.
author2 Andy Khong W H
author_facet Andy Khong W H
Cher, Jensen
format Final Year Project
author Cher, Jensen
author_sort Cher, Jensen
title What is that sound? An automated sound classification system using machine learning
title_short What is that sound? An automated sound classification system using machine learning
title_full What is that sound? An automated sound classification system using machine learning
title_fullStr What is that sound? An automated sound classification system using machine learning
title_full_unstemmed What is that sound? An automated sound classification system using machine learning
title_sort what is that sound? an automated sound classification system using machine learning
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/150286
_version_ 1772825108498350080