Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties

The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the...

Full description

Saved in:
Bibliographic Details
Main Authors: Alar, Hernan S., Fernandez, Proceso L, Jr
Format: text
Published: Archīum Ateneo 2021
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/230
https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#!
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1234
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-12342022-01-31T07:16:48Z Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties Alar, Hernan S. Fernandez, Proceso L, Jr The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the presence (or absence) of a mosquito and then identify the correct genus for those involving a mosquito. A dataset of sound recordings from the Humbug Project of Zooniverse, collected by researchers from Oxford University, and actual recordings of mosquitoes in the Philippines were used in this study. Our developed technique involves extracting filter bank values from corresponding spectrograms of the audio files, and we built a classification model based only on three simple statistics from said collected values -- maximum, first quartile and third quartile. Specifically, the maximum values were used in defining thresholds for the candidate-elimination phase of the algorithm, and then the first and third quartile values were used in the succeeding nearest centroid computation phase. The proposed algorithm yielded an impressive 97.2% average classification accuracy from a 5-fold stratified cross validation. This is competitive with the 75.55–97.65% accuracy results reported in literature for different mosquito classification tasks run on different datasets. Moreover, the achieved accuracy is significantly higher than the 86.6% that we gathered from applying a CNN architecture from literature to our same dataset. Aside from being more accurate, the proposed algorithm is also significantly more efficient than the CNN model, requiring much less time (in both training and predicting phases) and memory space. The results offer a promising technique that may also simplify the process of solving other sound-based classification problems. 2021-10-27T07:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/230 https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#! Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo mosquito genus classification candidate-elimination nearest centroid Biodiversity Computer Sciences Diseases Genetics and Genomics
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic mosquito
genus
classification
candidate-elimination
nearest centroid
Biodiversity
Computer Sciences
Diseases
Genetics and Genomics
spellingShingle mosquito
genus
classification
candidate-elimination
nearest centroid
Biodiversity
Computer Sciences
Diseases
Genetics and Genomics
Alar, Hernan S.
Fernandez, Proceso L, Jr
Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
description The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the presence (or absence) of a mosquito and then identify the correct genus for those involving a mosquito. A dataset of sound recordings from the Humbug Project of Zooniverse, collected by researchers from Oxford University, and actual recordings of mosquitoes in the Philippines were used in this study. Our developed technique involves extracting filter bank values from corresponding spectrograms of the audio files, and we built a classification model based only on three simple statistics from said collected values -- maximum, first quartile and third quartile. Specifically, the maximum values were used in defining thresholds for the candidate-elimination phase of the algorithm, and then the first and third quartile values were used in the succeeding nearest centroid computation phase. The proposed algorithm yielded an impressive 97.2% average classification accuracy from a 5-fold stratified cross validation. This is competitive with the 75.55–97.65% accuracy results reported in literature for different mosquito classification tasks run on different datasets. Moreover, the achieved accuracy is significantly higher than the 86.6% that we gathered from applying a CNN architecture from literature to our same dataset. Aside from being more accurate, the proposed algorithm is also significantly more efficient than the CNN model, requiring much less time (in both training and predicting phases) and memory space. The results offer a promising technique that may also simplify the process of solving other sound-based classification problems.
format text
author Alar, Hernan S.
Fernandez, Proceso L, Jr
author_facet Alar, Hernan S.
Fernandez, Proceso L, Jr
author_sort Alar, Hernan S.
title Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
title_short Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
title_full Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
title_fullStr Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
title_full_unstemmed Accurate and Efficient Mosquito Genus Classification Algorithm Using Candidate-Elimination and Nearest Centroid on Extracted Features of Wingbeat Acoustic Properties
title_sort accurate and efficient mosquito genus classification algorithm using candidate-elimination and nearest centroid on extracted features of wingbeat acoustic properties
publisher Archīum Ateneo
publishDate 2021
url https://archium.ateneo.edu/discs-faculty-pubs/230
https://www.sciencedirect.com/science/article/abs/pii/S0010482521007678#!
_version_ 1728621328221601792