IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM

Technological developments have an impact on financial transaction methods in the current era. The appearance of mobile transaction services such as e-banking, m-banking, digital payment, and others has become usual in the community. One service that is currently developing is mobile money. This...

Full description

Saved in:
Bibliographic Details
Main Author: Nurulhaqi Syahidah, Aisyah
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/49929
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:49929
spelling id-itb.:499292020-09-21T14:13:32ZIMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM Nurulhaqi Syahidah, Aisyah Indonesia Final Project mobile money, fraud, machine learning, fraud detection, naive bayes, gaussian naive bayes, categorical naive bayes INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/49929 Technological developments have an impact on financial transaction methods in the current era. The appearance of mobile transaction services such as e-banking, m-banking, digital payment, and others has become usual in the community. One service that is currently developing is mobile money. This service makes it easy for users to access various financial transactions from cellular phones, for example, are Gopay, OVO, and Dana. Behind the convenience obtained by the user, some risks must be managed so as not to cause harm to both parties, namely users and financial service providers. One of the risks is a fraud that can be observed from the transactions that occur. In this final project, a machine learning algorithm based on Naive Bayes is used, which consists of two algorithms, namely Gaussian Naive Bayes and Categorical Naïve Bayes. The algorithm is implemented through five stages, which are understanding business needs, understanding data, pre-processing data, optimizing parameters and modeling, and evaluating. In the pre-processing data stage, data transformation is performed by the algorithm to be created, the handling of the data imbalance with a combination of oversampling and undersampling methods using the SMOTE algorithm and the RandomUndersampler. At the parameter optimization and modeling stage, the random search method is used with the k-fold crossvalidation algorithm. The evaluation is carried out using four different metrics, which are recall, specificity, precision, and F1 score. The results of this study indicate that when the algorithm is applied to the PaySim test data, the Gaussian Naive Bayes algorithm provides better performance than the Categorical Naive Bayes algorithm. The Gaussian Naive Bayes algorithm can achieve recall 0.926540, fpr 0.190328, specificity 0.809672, auc score 0.868106, precision 0.014658, and F1 score 0.028859. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Technological developments have an impact on financial transaction methods in the current era. The appearance of mobile transaction services such as e-banking, m-banking, digital payment, and others has become usual in the community. One service that is currently developing is mobile money. This service makes it easy for users to access various financial transactions from cellular phones, for example, are Gopay, OVO, and Dana. Behind the convenience obtained by the user, some risks must be managed so as not to cause harm to both parties, namely users and financial service providers. One of the risks is a fraud that can be observed from the transactions that occur. In this final project, a machine learning algorithm based on Naive Bayes is used, which consists of two algorithms, namely Gaussian Naive Bayes and Categorical Naïve Bayes. The algorithm is implemented through five stages, which are understanding business needs, understanding data, pre-processing data, optimizing parameters and modeling, and evaluating. In the pre-processing data stage, data transformation is performed by the algorithm to be created, the handling of the data imbalance with a combination of oversampling and undersampling methods using the SMOTE algorithm and the RandomUndersampler. At the parameter optimization and modeling stage, the random search method is used with the k-fold crossvalidation algorithm. The evaluation is carried out using four different metrics, which are recall, specificity, precision, and F1 score. The results of this study indicate that when the algorithm is applied to the PaySim test data, the Gaussian Naive Bayes algorithm provides better performance than the Categorical Naive Bayes algorithm. The Gaussian Naive Bayes algorithm can achieve recall 0.926540, fpr 0.190328, specificity 0.809672, auc score 0.868106, precision 0.014658, and F1 score 0.028859.
format Final Project
author Nurulhaqi Syahidah, Aisyah
spellingShingle Nurulhaqi Syahidah, Aisyah
IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
author_facet Nurulhaqi Syahidah, Aisyah
author_sort Nurulhaqi Syahidah, Aisyah
title IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
title_short IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
title_full IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
title_fullStr IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
title_full_unstemmed IMPLEMENTATION OF NAIVE BAYES-BASED MACHINE LEARNING ALGORITHM IN THE FRAUD DETECTION SYSTEM
title_sort implementation of naive bayes-based machine learning algorithm in the fraud detection system
url https://digilib.itb.ac.id/gdl/view/49929
_version_ 1822928315209482240