The study of attention mechanism in sound classification
In modern society today, high rise building and motor vehicles exist all around us. The robust sound events detected within the environment can be heard clearly by us. However, this might not be the case for machines. I.e. Robots. Sounds captured by machines, are often filled with interference from...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149298 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-149298 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1492982023-07-07T18:27:11Z The study of attention mechanism in sound classification Goh, Jonathan Jun Yu Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Engineering::Electrical and electronic engineering In modern society today, high rise building and motor vehicles exist all around us. The robust sound events detected within the environment can be heard clearly by us. However, this might not be the case for machines. I.e. Robots. Sounds captured by machines, are often filled with interference from the environment. As a result, when specific sounds such as breaking of glass or even car honks are detected, machines are unable to identify the sound event accurately. This created the idea of developing an automated sound classifier using the deep-learning model. By leveraging on the attention mechanism used in transformers in combination with CNN, the new system allows multiple sound events to be attended at the same time which promotes a parallel computation approach as compared to other neural networks such as the Recurrent Neural Network (RNN) and Long-short term memory (LSTM) which suffers from a series computation approach. Overall, The performances of the network models are evaluated based on different parameters to get a comparison of which network models is best suited for time-series dataset. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-05-29T12:10:10Z 2021-05-29T12:10:10Z 2021 Final Year Project (FYP) Goh, J. J. Y. (2021). The study of attention mechanism in sound classification. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149298 https://hdl.handle.net/10356/149298 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Goh, Jonathan Jun Yu The study of attention mechanism in sound classification |
description |
In modern society today, high rise building and motor vehicles exist all around us. The robust sound events detected within the environment can be heard clearly by us. However, this might not be the case for machines. I.e. Robots. Sounds captured by machines, are often filled with interference from the environment. As a result, when specific sounds such as breaking of glass or even car honks are detected, machines are unable to identify the sound event accurately.
This created the idea of developing an automated sound classifier using the deep-learning model. By leveraging on the attention mechanism used in transformers in combination with CNN, the new system allows multiple sound events to be attended at the same time which promotes a parallel computation approach as compared to other neural networks such as the Recurrent Neural Network (RNN) and Long-short term memory (LSTM) which suffers from a series computation approach. Overall, The performances of the network models are evaluated based on different parameters to get a comparison of which network models is best suited for time-series dataset. |
author2 |
Andy Khong W H |
author_facet |
Andy Khong W H Goh, Jonathan Jun Yu |
format |
Final Year Project |
author |
Goh, Jonathan Jun Yu |
author_sort |
Goh, Jonathan Jun Yu |
title |
The study of attention mechanism in sound classification |
title_short |
The study of attention mechanism in sound classification |
title_full |
The study of attention mechanism in sound classification |
title_fullStr |
The study of attention mechanism in sound classification |
title_full_unstemmed |
The study of attention mechanism in sound classification |
title_sort |
study of attention mechanism in sound classification |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/149298 |
_version_ |
1772828510263443456 |