Thunderstorm prediction model using SMOTE sampling and machine learning approach.

Thunderstorms are one of the most destructive phenomena worldwide and are primarily associated with lightning and heavy rain that cause human fatalities, urban floods, and crop damage. Therefore, predicting thunderstorms with reasonable accuracy is one of the crucial requirements for the planning an...

Full description

Saved in:
Bibliographic Details
Main Authors: Rufus, Shirley Anak, Ahmad, Noor Azlinda, Abdul-Malek, Zulkurnain, Abdullah, Noradlina
Format: Conference or Workshop Item
Published: 2023
Subjects:
Online Access:http://eprints.utm.my/107746/
http://dx.doi.org/10.1109/APL57308.2023.10182046
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.107746
record_format eprints
spelling my.utm.1077462024-10-02T06:34:53Z http://eprints.utm.my/107746/ Thunderstorm prediction model using SMOTE sampling and machine learning approach. Rufus, Shirley Anak Ahmad, Noor Azlinda Abdul-Malek, Zulkurnain Abdullah, Noradlina TK Electrical engineering. Electronics Nuclear engineering Thunderstorms are one of the most destructive phenomena worldwide and are primarily associated with lightning and heavy rain that cause human fatalities, urban floods, and crop damage. Therefore, predicting thunderstorms with reasonable accuracy is one of the crucial requirements for the planning and management of many applications, including agriculture, flood control, and air traffic control. This study extensively applied the historical lightning and meteorological data from 2011 to 2018 of the southern regions of Peninsular Malaysia to predict thunderstorm occurrence. Positive CG lightning rarely occurs compared to negative CG lightning and also due to the non-linear and complex characteristics of the thunderstorm and lightning itself, leading to an imbalance in the dataset. The resampling technique called SMOTE is introduced to overcome the imbalance of the training dataset. Then the dataset is trained and tested with five Machine Learning (ML) algorithms, including Decision Trees (DT), Adaptive Boosting (AdaBoost), Random Forest (RF), Extra Trees (ET), and Gradient Boosting (GB). The results have shown a good prediction with accuracy (74% to 95%), recall (72% to 93%), precision (76% to 97%), and F1-Score (74% to 95%) with SMOTE. The SMOTE and GB model prediction model is the best algorithm for thunderstorm prediction for this region in terms of performance metrics. In the future, the prediction results based on the lightning pattern and weather dataset will likely alert the related authorities to make an early strategy to handle the occurrence of thunderstorms. 2023-07-19 Conference or Workshop Item PeerReviewed Rufus, Shirley Anak and Ahmad, Noor Azlinda and Abdul-Malek, Zulkurnain and Abdullah, Noradlina (2023) Thunderstorm prediction model using SMOTE sampling and machine learning approach. In: 12th Asia-Pacific International Conference on Lightning, APL 2023, 12 June 2023 - 15 June 2023, Langkawi, Kedah, Malaysia. http://dx.doi.org/10.1109/APL57308.2023.10182046
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Rufus, Shirley Anak
Ahmad, Noor Azlinda
Abdul-Malek, Zulkurnain
Abdullah, Noradlina
Thunderstorm prediction model using SMOTE sampling and machine learning approach.
description Thunderstorms are one of the most destructive phenomena worldwide and are primarily associated with lightning and heavy rain that cause human fatalities, urban floods, and crop damage. Therefore, predicting thunderstorms with reasonable accuracy is one of the crucial requirements for the planning and management of many applications, including agriculture, flood control, and air traffic control. This study extensively applied the historical lightning and meteorological data from 2011 to 2018 of the southern regions of Peninsular Malaysia to predict thunderstorm occurrence. Positive CG lightning rarely occurs compared to negative CG lightning and also due to the non-linear and complex characteristics of the thunderstorm and lightning itself, leading to an imbalance in the dataset. The resampling technique called SMOTE is introduced to overcome the imbalance of the training dataset. Then the dataset is trained and tested with five Machine Learning (ML) algorithms, including Decision Trees (DT), Adaptive Boosting (AdaBoost), Random Forest (RF), Extra Trees (ET), and Gradient Boosting (GB). The results have shown a good prediction with accuracy (74% to 95%), recall (72% to 93%), precision (76% to 97%), and F1-Score (74% to 95%) with SMOTE. The SMOTE and GB model prediction model is the best algorithm for thunderstorm prediction for this region in terms of performance metrics. In the future, the prediction results based on the lightning pattern and weather dataset will likely alert the related authorities to make an early strategy to handle the occurrence of thunderstorms.
format Conference or Workshop Item
author Rufus, Shirley Anak
Ahmad, Noor Azlinda
Abdul-Malek, Zulkurnain
Abdullah, Noradlina
author_facet Rufus, Shirley Anak
Ahmad, Noor Azlinda
Abdul-Malek, Zulkurnain
Abdullah, Noradlina
author_sort Rufus, Shirley Anak
title Thunderstorm prediction model using SMOTE sampling and machine learning approach.
title_short Thunderstorm prediction model using SMOTE sampling and machine learning approach.
title_full Thunderstorm prediction model using SMOTE sampling and machine learning approach.
title_fullStr Thunderstorm prediction model using SMOTE sampling and machine learning approach.
title_full_unstemmed Thunderstorm prediction model using SMOTE sampling and machine learning approach.
title_sort thunderstorm prediction model using smote sampling and machine learning approach.
publishDate 2023
url http://eprints.utm.my/107746/
http://dx.doi.org/10.1109/APL57308.2023.10182046
_version_ 1814043512747851776