IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND
The impact of deforestation has made it several animals difficult to communicate with each other. Communication is usually built through the ability to transmit acoustic signals that can be understood by fellow species. The activity of nature sounds produced by animal sounds (biophony) can be used a...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/68970 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:68970 |
---|---|
spelling |
id-itb.:689702022-09-19T18:56:16ZIMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND Satria, Ardika Indonesia Theses CNN, Deep Learning, Transfer Learning, Mels Spectrogram, MFCC, Sumatera, Bird Classification INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/68970 The impact of deforestation has made it several animals difficult to communicate with each other. Communication is usually built through the ability to transmit acoustic signals that can be understood by fellow species. The activity of nature sounds produced by animal sounds (biophony) can be used as an indicator of changes in their habitat environment. Therefore, it is important for monitoring animal sounds to determine whether these species can adapt to new niches or migrate to other niches. In the past few years, the remaining conservation forest that protects bird species on Sumatera Island is only 5 million ha. In this study, modeling of bird sound classification on Sumatra Island was carried out using the Convolution Neural Network (CNN) method through the implementation of transfer learning on a deep neural network. The transfer learning models used were ResNet50, DenseNet169, and VGG19 which were tested on three optimizers such as Adam, RMSProp, and SGD. The results obtained are that VGG19 is the best model with an accuracy reached 91 percent for the mels spectrogram feature extraction, at a learning rate of 10-4, dropout of 50 percent, batch size 32 with Adam optimizer. In the Mels Frequency Cepstral Coefficient (MFCC) feature extraction, the accuracy obtained is 84 percent at a learning rate of 10-3, dropout of 20 percent, batch size 16 with Adam optimizer. The F1 score classification values obtained for both feature extractions reached 0.91 and 0.84. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The impact of deforestation has made it several animals difficult to communicate with each other. Communication is usually built through the ability to transmit acoustic signals that can be understood by fellow species. The activity of nature sounds produced by animal sounds (biophony) can be used as an indicator of changes in their habitat environment. Therefore, it is important for monitoring animal sounds to determine whether these species can adapt to new niches or migrate to other niches. In the past few years, the remaining conservation forest that protects bird species on Sumatera Island is only 5 million ha. In this study, modeling of bird sound classification on Sumatra Island was carried out using the Convolution Neural Network (CNN) method through the implementation of transfer learning on a deep neural network. The transfer learning models used were ResNet50, DenseNet169, and VGG19 which were tested on three optimizers such as Adam, RMSProp, and SGD. The results obtained are that VGG19 is the best model with an accuracy reached 91 percent for the mels spectrogram feature extraction, at a learning rate of 10-4, dropout of 50 percent, batch size 32 with Adam optimizer. In the Mels Frequency Cepstral Coefficient (MFCC) feature extraction, the accuracy obtained is 84 percent at a learning rate of 10-3, dropout of 20 percent, batch size 16 with Adam optimizer. The F1 score classification values obtained for both feature extractions reached 0.91 and 0.84. |
format |
Theses |
author |
Satria, Ardika |
spellingShingle |
Satria, Ardika IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
author_facet |
Satria, Ardika |
author_sort |
Satria, Ardika |
title |
IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
title_short |
IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
title_full |
IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
title_fullStr |
IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
title_full_unstemmed |
IMPLEMENTATION OF TRANSFER LEARNING ON DEEP NEURAL NETWORK FOR IDENTIFICATION BIRDS SOUND IN SUMATERA ISLAND |
title_sort |
implementation of transfer learning on deep neural network for identification birds sound in sumatera island |
url |
https://digilib.itb.ac.id/gdl/view/68970 |
_version_ |
1822278363336671232 |