Boosting the accuracy of phishing detection with less features using XGBOOST
Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individua...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
iJournals Publication
2020
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/87308/ https://ijournals.in/ijshre-volume-8-issue-2/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.87308 |
---|---|
record_format |
eprints |
spelling |
my.utm.873082020-10-31T12:29:35Z http://eprints.utm.my/id/eprint/87308/ Boosting the accuracy of phishing detection with less features using XGBOOST Musa, Hajara Gital, A. Y. Bitrus, Mohzo Gideon Juma'at, Nurul Farhana Balde, Muhammad Abubakar L Education (General) Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individuals or group of individuals. In this paper we focus on studying various features employed in different phishing attacks. So many studies have been conducted on single feature to have high accuracy for attack detection while others advanced on the use of many features to detect different attack behaviors with high accuracy. Researchers have advanced the study to the adoption and standardization of thirty (30) features to be examined in phishing attack in order to achieve high accuracy of detection. We examined all the features used so far and used XGBOOST classification model to categories the features into different kinds to detect important features. The analysis revealed that some features hampers on the accuracy and are unfruitful which also contributes in slowing the whole detection process. The model helps us to select useful features and weeds out the useless features. This yields higher accuracy and less time in detection process. iJournals Publication 2020-02 Article PeerReviewed Musa, Hajara and Gital, A. Y. and Bitrus, Mohzo Gideon and Juma'at, Nurul Farhana and Balde, Muhammad Abubakar (2020) Boosting the accuracy of phishing detection with less features using XGBOOST. International Journal of Software & Hardware Research in Engineering, 8 (2). pp. 81-90. ISSN 2347-4890 https://ijournals.in/ijshre-volume-8-issue-2/ |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
L Education (General) |
spellingShingle |
L Education (General) Musa, Hajara Gital, A. Y. Bitrus, Mohzo Gideon Juma'at, Nurul Farhana Balde, Muhammad Abubakar Boosting the accuracy of phishing detection with less features using XGBOOST |
description |
Phishing has been for a long time a difficult threat in every society as it changes form with time and it has taken billions of dollars from governments, companies and individuals alike. It is an identity theft which employs a kind of social engineering attack to get vital information from individuals or group of individuals. In this paper we focus on studying various features employed in different phishing attacks. So many studies have been conducted on single feature to have high accuracy for attack detection while others advanced on the use of many features to detect different attack behaviors with high accuracy. Researchers have advanced the study to the adoption and standardization of thirty (30) features to be examined in phishing attack in order to achieve high accuracy of detection. We examined all the features used so far and used XGBOOST classification model to categories the features into different kinds to detect important features. The analysis revealed that some features hampers on the accuracy and are unfruitful which also contributes in slowing the whole detection process. The model helps us to select useful features and weeds out the useless features. This yields higher accuracy and less time in detection process. |
format |
Article |
author |
Musa, Hajara Gital, A. Y. Bitrus, Mohzo Gideon Juma'at, Nurul Farhana Balde, Muhammad Abubakar |
author_facet |
Musa, Hajara Gital, A. Y. Bitrus, Mohzo Gideon Juma'at, Nurul Farhana Balde, Muhammad Abubakar |
author_sort |
Musa, Hajara |
title |
Boosting the accuracy of phishing detection with less features using XGBOOST |
title_short |
Boosting the accuracy of phishing detection with less features using XGBOOST |
title_full |
Boosting the accuracy of phishing detection with less features using XGBOOST |
title_fullStr |
Boosting the accuracy of phishing detection with less features using XGBOOST |
title_full_unstemmed |
Boosting the accuracy of phishing detection with less features using XGBOOST |
title_sort |
boosting the accuracy of phishing detection with less features using xgboost |
publisher |
iJournals Publication |
publishDate |
2020 |
url |
http://eprints.utm.my/id/eprint/87308/ https://ijournals.in/ijshre-volume-8-issue-2/ |
_version_ |
1683230719041601536 |