Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains

Rice postharvest losses affects the sustainability in human consumption of rice. About one quarter to one third of the world grain crop is lost every year during storage, and most of this is caused by insect infestation. Even though there are grains that remain after the feeding of these insects, gr...

Full description

Saved in:
Bibliographic Details
Main Author: Rabano, Stephenn L.
Format: text
Language:English
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_masteral/6345
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13418/viewcontent/Rabano_S_MS_ECE_Thesis2_Redacted.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_masteral-13418
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_masteral-134182022-09-17T04:02:32Z Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains Rabano, Stephenn L. Rice postharvest losses affects the sustainability in human consumption of rice. About one quarter to one third of the world grain crop is lost every year during storage, and most of this is caused by insect infestation. Even though there are grains that remain after the feeding of these insects, grain quality is also reduced by damages from the insects. One of these rampant insects is the rice weevil. Rice fumigation and other measures are done to avoid further infestation once the presence of rice weevils has manifested. Immediate recognition of the presence of these insects will prompt for immediate fumigation so that losses in rice postharvest are reduced. This study compared the accuracy of using an MFCC-based model with the accuracy of using deep transfer learning in the detection of rice weevil. An STM32 NUCLEO-F401RE development board was used along with an X-NUCLEO-CCA02M1 expansion board for audio acquisition. The recording was done by the open source audio software Audacity 2.2.2. MFCCs were extracted from the dataset of audio recordings with two clusters, one with rice weevil sound (positive cluster) and another without rice weevil sound (negative cluster). Classification was done using logistic regression on the MFCCs of the audio files. For deep transfer learning, the same dataset was used to generate spectrogram images of the audio files. Some Keras pre-trained models (Xception, VGG16, VGG19, ResNet50, InceptionV3, InceptionResNetV2, and MobileNet), trained on ImageNet, were used on those spectrogram images for feature extraction and prediction. The programming platform that will be used in both methods is Python 3.6.5. Considering preprocessed, unprocessed, and attenuated data, InceptionResNetV2 and MobileNet had the fastest prediction time at 171.8 ms. VGG19 had the highest average prediction at 99%, the highest average recall at 99%, and the highest average F1 score also at 99%. Using the same audio data, the MFCC-based model got an average precision of 97%. The same percentage was also the value of its average recall and average F1 score. This model performed at par with the Keras pre-trained models. 2018-12-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/6345 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13418/viewcontent/Rabano_S_MS_ECE_Thesis2_Redacted.pdf Master's Theses English Animo Repository Acoustic imaging Deep learning (Machine learning) Rice weevil—Detection Electrical and Computer Engineering
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Acoustic imaging
Deep learning (Machine learning)
Rice weevil—Detection
Electrical and Computer Engineering
spellingShingle Acoustic imaging
Deep learning (Machine learning)
Rice weevil—Detection
Electrical and Computer Engineering
Rabano, Stephenn L.
Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
description Rice postharvest losses affects the sustainability in human consumption of rice. About one quarter to one third of the world grain crop is lost every year during storage, and most of this is caused by insect infestation. Even though there are grains that remain after the feeding of these insects, grain quality is also reduced by damages from the insects. One of these rampant insects is the rice weevil. Rice fumigation and other measures are done to avoid further infestation once the presence of rice weevils has manifested. Immediate recognition of the presence of these insects will prompt for immediate fumigation so that losses in rice postharvest are reduced. This study compared the accuracy of using an MFCC-based model with the accuracy of using deep transfer learning in the detection of rice weevil. An STM32 NUCLEO-F401RE development board was used along with an X-NUCLEO-CCA02M1 expansion board for audio acquisition. The recording was done by the open source audio software Audacity 2.2.2. MFCCs were extracted from the dataset of audio recordings with two clusters, one with rice weevil sound (positive cluster) and another without rice weevil sound (negative cluster). Classification was done using logistic regression on the MFCCs of the audio files. For deep transfer learning, the same dataset was used to generate spectrogram images of the audio files. Some Keras pre-trained models (Xception, VGG16, VGG19, ResNet50, InceptionV3, InceptionResNetV2, and MobileNet), trained on ImageNet, were used on those spectrogram images for feature extraction and prediction. The programming platform that will be used in both methods is Python 3.6.5. Considering preprocessed, unprocessed, and attenuated data, InceptionResNetV2 and MobileNet had the fastest prediction time at 171.8 ms. VGG19 had the highest average prediction at 99%, the highest average recall at 99%, and the highest average F1 score also at 99%. Using the same audio data, the MFCC-based model got an average precision of 97%. The same percentage was also the value of its average recall and average F1 score. This model performed at par with the Keras pre-trained models.
format text
author Rabano, Stephenn L.
author_facet Rabano, Stephenn L.
author_sort Rabano, Stephenn L.
title Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
title_short Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
title_full Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
title_fullStr Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
title_full_unstemmed Deep transfer learning based and MFCC based acoustic detector of rice weevils, Sitophilus oryzae (L.) in stored grains
title_sort deep transfer learning based and mfcc based acoustic detector of rice weevils, sitophilus oryzae (l.) in stored grains
publisher Animo Repository
publishDate 2018
url https://animorepository.dlsu.edu.ph/etd_masteral/6345
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13418/viewcontent/Rabano_S_MS_ECE_Thesis2_Redacted.pdf
_version_ 1772835762918653952