Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2021
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/230 https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#! |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1234 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-12342022-01-31T07:16:48Z Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties Alar, Hernan S. Fernandez, Proceso L, Jr The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the presence (or absence) of a mosquito and then identify the correct genus for those involving a mosquito. A dataset of sound recordings from the Humbug Project of Zooniverse, collected by researchers from Oxford University, and actual recordings of mosquitoes in the Philippines were used in this study. Our developed technique involves extracting filter bank values from corresponding spectrograms of the audio files, and we built a classification model based only on three simple statistics from said collected values -- maximum, first quartile and third quartile. Specifically, the maximum values were used in defining thresholds for the candidate-elimination phase of the algorithm, and then the first and third quartile values were used in the succeeding nearest centroid computation phase. The proposed algorithm yielded an impressive 97.2% average classification accuracy from a 5-fold stratified cross validation. This is competitive with the 75.55–97.65% accuracy results reported in literature for different mosquito classification tasks run on different datasets. Moreover, the achieved accuracy is significantly higher than the 86.6% that we gathered from applying a CNN architecture from literature to our same dataset. Aside from being more accurate, the proposed algorithm is also significantly more efficient than the CNN model, requiring much less time (in both training and predicting phases) and memory space. The results offer a promising technique that may also simplify the process of solving other sound-based classification problems. 2021-10-27T07:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/230 https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#! Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo mosquito genus classification candidate-elimination nearest centroid Biodiversity Computer Sciences Diseases Genetics and Genomics |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
mosquito genus classification candidate-elimination nearest centroid Biodiversity Computer Sciences Diseases Genetics and Genomics |
spellingShingle |
mosquito genus classification candidate-elimination nearest centroid Biodiversity Computer Sciences Diseases Genetics and Genomics Alar, Hernan S. Fernandez, Proceso L, Jr Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
description |
The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the presence (or absence) of a mosquito and then identify the correct genus for those involving a mosquito. A dataset of sound recordings from the Humbug Project of Zooniverse, collected by researchers from Oxford University, and actual recordings of mosquitoes in the Philippines were used in this study. Our developed technique involves extracting filter bank values from corresponding spectrograms of the audio files, and we built a classification model based only on three simple statistics from said collected values -- maximum, first quartile and third quartile. Specifically, the maximum values were used in defining thresholds for the candidate-elimination phase of the algorithm, and then the first and third quartile values were used in the succeeding nearest centroid computation phase. The proposed algorithm yielded an impressive 97.2% average classification accuracy from a 5-fold stratified cross validation. This is competitive with the 75.55–97.65% accuracy results reported in literature for different mosquito classification tasks run on different datasets. Moreover, the achieved accuracy is significantly higher than the 86.6% that we gathered from applying a CNN architecture from literature to our same dataset. Aside from being more accurate, the proposed algorithm is also significantly more efficient than the CNN model, requiring much less time (in both training and predicting phases) and memory space. The results offer a promising technique that may also simplify the process of solving other sound-based classification problems. |
format |
text |
author |
Alar, Hernan S. Fernandez, Proceso L, Jr |
author_facet |
Alar, Hernan S. Fernandez, Proceso L, Jr |
author_sort |
Alar, Hernan S. |
title |
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
title_short |
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
title_full |
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
title_fullStr |
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
title_full_unstemmed |
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties |
title_sort |
accurate and efficient mosquito genus classification algorithm using candidate-elimination and nearest centroid on extracted features of wingbeat acoustic properties |
publisher |
Archīum Ateneo |
publishDate |
2021 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/230 https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#! |
_version_ |
1728621328221601792 |