Evaluation of feature selection algorithm for android malware detection
This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the t...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Published: |
Science Publishing Corporation
2018
|
Subjects: | |
Online Access: | http://eprints.uthm.edu.my/5019/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Tun Hussein Onn Malaysia |
id |
my.uthm.eprints.5019 |
---|---|
record_format |
eprints |
spelling |
my.uthm.eprints.50192022-01-03T06:14:49Z http://eprints.uthm.edu.my/5019/ Evaluation of feature selection algorithm for android malware detection Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi TA Engineering (General). Civil engineering (General) TA168 Systems engineering This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection. Science Publishing Corporation 2018 Article PeerReviewed Mazlan, Nurul Hidayah and A Hamid, Isredza Rahmi (2018) Evaluation of feature selection algorithm for android malware detection. International Journal of Engineering & Technology, 7 (4.31). pp. 311-315. ISSN 2227-524X |
institution |
Universiti Tun Hussein Onn Malaysia |
building |
UTHM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tun Hussein Onn Malaysia |
content_source |
UTHM Institutional Repository |
url_provider |
http://eprints.uthm.edu.my/ |
topic |
TA Engineering (General). Civil engineering (General) TA168 Systems engineering |
spellingShingle |
TA Engineering (General). Civil engineering (General) TA168 Systems engineering Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi Evaluation of feature selection algorithm for android malware detection |
description |
This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection. |
format |
Article |
author |
Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi |
author_facet |
Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi |
author_sort |
Mazlan, Nurul Hidayah |
title |
Evaluation of feature selection algorithm for android malware detection |
title_short |
Evaluation of feature selection algorithm for android malware detection |
title_full |
Evaluation of feature selection algorithm for android malware detection |
title_fullStr |
Evaluation of feature selection algorithm for android malware detection |
title_full_unstemmed |
Evaluation of feature selection algorithm for android malware detection |
title_sort |
evaluation of feature selection algorithm for android malware detection |
publisher |
Science Publishing Corporation |
publishDate |
2018 |
url |
http://eprints.uthm.edu.my/5019/ |
_version_ |
1738581325885997056 |