Boosting the accuracy of phishing detection with less features using XGBOOST

Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individua...

Full description

Saved in:
Bibliographic Details
Main Authors: Musa, Hajara, Gital, A. Y., Bitrus, Mohzo Gideon, Juma'at, Nurul Farhana, Balde, Muhammad Abubakar
Format: Article
Published: iJournals Publication 2020
Subjects:
Online Access:http://eprints.utm.my/id/eprint/87308/
https://ijournals.in/ijshre-volume-8-issue-2/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.87308
record_format eprints
spelling my.utm.873082020-10-31T12:29:35Z http://eprints.utm.my/id/eprint/87308/ Boosting the accuracy of phishing detection with less features using XGBOOST Musa, Hajara Gital, A. Y. Bitrus, Mohzo Gideon Juma'at, Nurul Farhana Balde, Muhammad Abubakar L Education (General) Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individuals or group of individuals. In this paper we focus on studying various features employed in different phishing attacks. So many studies have been conducted on single feature to have high accuracy for attack detection while others advanced on the use of many features to detect different attack behaviors with high accuracy. Researchers have advanced the study to the adoption and standardization of thirty (30) features to be examined in phishing attack in order to achieve high accuracy of detection. We examined all the features used so far and used XGBOOST classification model to categories the features into different kinds to detect important features. The analysis revealed that some features hampers on the accuracy and are unfruitful which also contributes in slowing the whole detection process. The model helps us to select useful features and weeds out the useless features. This yields higher accuracy and less time in detection process. iJournals Publication 2020-02 Article PeerReviewed Musa, Hajara and Gital, A. Y. and Bitrus, Mohzo Gideon and Juma'at, Nurul Farhana and Balde, Muhammad Abubakar (2020) Boosting the accuracy of phishing detection with less features using XGBOOST. International Journal of Software & Hardware Research in Engineering, 8 (2). pp. 81-90. ISSN 2347-4890 https://ijournals.in/ijshre-volume-8-issue-2/
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic L Education (General)
spellingShingle L Education (General)
Musa, Hajara
Gital, A. Y.
Bitrus, Mohzo Gideon
Juma'at, Nurul Farhana
Balde, Muhammad Abubakar
Boosting the accuracy of phishing detection with less features using XGBOOST
description Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individuals or group of individuals. In this paper we focus on studying various features employed in different phishing attacks. So many studies have been conducted on single feature to have high accuracy for attack detection while others advanced on the use of many features to detect different attack behaviors with high accuracy. Researchers have advanced the study to the adoption and standardization of thirty (30) features to be examined in phishing attack in order to achieve high accuracy of detection. We examined all the features used so far and used XGBOOST classification model to categories the features into different kinds to detect important features. The analysis revealed that some features hampers on the accuracy and are unfruitful which also contributes in slowing the whole detection process. The model helps us to select useful features and weeds out the useless features. This yields higher accuracy and less time in detection process.
format Article
author Musa, Hajara
Gital, A. Y.
Bitrus, Mohzo Gideon
Juma'at, Nurul Farhana
Balde, Muhammad Abubakar
author_facet Musa, Hajara
Gital, A. Y.
Bitrus, Mohzo Gideon
Juma'at, Nurul Farhana
Balde, Muhammad Abubakar
author_sort Musa, Hajara
title Boosting the accuracy of phishing detection with less features using XGBOOST
title_short Boosting the accuracy of phishing detection with less features using XGBOOST
title_full Boosting the accuracy of phishing detection with less features using XGBOOST
title_fullStr Boosting the accuracy of phishing detection with less features using XGBOOST
title_full_unstemmed Boosting the accuracy of phishing detection with less features using XGBOOST
title_sort boosting the accuracy of phishing detection with less features using xgboost
publisher iJournals Publication
publishDate 2020
url http://eprints.utm.my/id/eprint/87308/
https://ijournals.in/ijshre-volume-8-issue-2/
_version_ 1683230719041601536