Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorith...

Full description

Saved in:
Bibliographic Details
Main Authors: Ong, Song Quan, Pradeep Isawasan, Ahmad Mohiddin Mohd Ngesom, Hanipah Shahar, As’malia Md Lasim, Gomesh Nair
Format: Article
Language:English
English
Published: Springer Nature 2023
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/38089/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/38089/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/38089/
https://doi.org/10.1038/s41598-023-46342-2
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sabah
Language: English
English
id my.ums.eprints.38089
record_format eprints
spelling my.ums.eprints.380892024-01-31T07:24:53Z https://eprints.ums.edu.my/id/eprint/38089/ Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data Ong, Song Quan Pradeep Isawasan Ahmad Mohiddin Mohd Ngesom Hanipah Shahar As’malia Md Lasim Gomesh Nair QR355-502 Virology RC109-216 Infectious and parasitic diseases Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naïve Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system. Springer Nature 2023 Article NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/38089/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/38089/2/FULL%20TEXT.pdf Ong, Song Quan and Pradeep Isawasan and Ahmad Mohiddin Mohd Ngesom and Hanipah Shahar and As’malia Md Lasim and Gomesh Nair (2023) Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data. Scientifc Reports, 13 (19129). pp. 1-11. https://doi.org/10.1038/s41598-023-46342-2
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic QR355-502 Virology
RC109-216 Infectious and parasitic diseases
spellingShingle QR355-502 Virology
RC109-216 Infectious and parasitic diseases
Ong, Song Quan
Pradeep Isawasan
Ahmad Mohiddin Mohd Ngesom
Hanipah Shahar
As’malia Md Lasim
Gomesh Nair
Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
description Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naïve Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
format Article
author Ong, Song Quan
Pradeep Isawasan
Ahmad Mohiddin Mohd Ngesom
Hanipah Shahar
As’malia Md Lasim
Gomesh Nair
author_facet Ong, Song Quan
Pradeep Isawasan
Ahmad Mohiddin Mohd Ngesom
Hanipah Shahar
As’malia Md Lasim
Gomesh Nair
author_sort Ong, Song Quan
title Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_short Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_full Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_fullStr Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_full_unstemmed Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_sort predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
publisher Springer Nature
publishDate 2023
url https://eprints.ums.edu.my/id/eprint/38089/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/38089/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/38089/
https://doi.org/10.1038/s41598-023-46342-2
_version_ 1789941993555623936