Machine learning approach of predicting Airline flight delay using Naïve Bayes Algorithm / Ahmad Adib Baihaqi Shukri ... [et al.]

The aviation industry plays a critical role in global transportation, facilitating economic growth and revolutionizing travel. However, flight delays have become a growing concern, impacting both airlines and passengers. This study aims to study the Naïve Bayes algorithm...

Full description

Saved in:
Bibliographic Details
Main Authors: Shukri, Ahmad Adib Baihaqi, Mohamed Yusoff, Syarifah Adilah, Warris, Saiful Nizam, Abu Bakar, Mohd Saifulnizam, Kadar, Rozita
Format: Article
Language:English
Published: UiTM Cawangan Perlis 2024
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/103187/1/103187.pdf
https://ir.uitm.edu.my/id/eprint/103187/
https://jcrinn.com/index.php/jcrinn
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Mara
Language: English
Description
Summary:The aviation industry plays a critical role in global transportation, facilitating economic growth and revolutionizing travel. However, flight delays have become a growing concern, impacting both airlines and passengers. This study aims to study the Naïve Bayes algorithm for flight delay prediction. The objective is to develop a reliable flight delay prediction model using the Naïve Bayes algorithm and evaluate its performance. The data set that records flight delay and cancellation data from U.S Department of Transportation’s (DOT) was used for the prediction. This study has modified the parameter tuning for Gaussian Naïve Bayes to identify optimum values specifically to construct model for this flight delay dataset. The performance of parameters tuning Gaussian Naïve Bayes model was compared with another two well-known algorithms which are K-Nearest Neighbors (KNN) and Support Vector Machine (SVM)). The KNN and SVM algorithms were alsotrained and tested to complete the binary classification of flight delays for benchmarking purposes. The evaluation of algorithms was fulfilled by comparing the values of accuracy, specificity and ROC AUC score. The comparative analysis showed that the Gaussian Naïve Bayes has the best performance with an accuracy of 93% and KNN has the worst performance with ROC AUC score 63%.