Evaluation of feature selection algorithm for android malware detection

This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the t...

Full description

Saved in:
Bibliographic Details
Main Authors: Mazlan, Nurul Hidayah, A Hamid, Isredza Rahmi
Format: Article
Published: Science Publishing Corporation 2018
Subjects:
Online Access:http://eprints.uthm.edu.my/5019/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Tun Hussein Onn Malaysia
id my.uthm.eprints.5019
record_format eprints
spelling my.uthm.eprints.50192022-01-03T06:14:49Z http://eprints.uthm.edu.my/5019/ Evaluation of feature selection algorithm for android malware detection Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi TA Engineering (General). Civil engineering (General) TA168 Systems engineering This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection. Science Publishing Corporation 2018 Article PeerReviewed Mazlan, Nurul Hidayah and A Hamid, Isredza Rahmi (2018) Evaluation of feature selection algorithm for android malware detection. International Journal of Engineering & Technology, 7 (4.31). pp. 311-315. ISSN 2227-524X
institution Universiti Tun Hussein Onn Malaysia
building UTHM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tun Hussein Onn Malaysia
content_source UTHM Institutional Repository
url_provider http://eprints.uthm.edu.my/
topic TA Engineering (General). Civil engineering (General)
TA168 Systems engineering
spellingShingle TA Engineering (General). Civil engineering (General)
TA168 Systems engineering
Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
Evaluation of feature selection algorithm for android malware detection
description This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection.
format Article
author Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
author_facet Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
author_sort Mazlan, Nurul Hidayah
title Evaluation of feature selection algorithm for android malware detection
title_short Evaluation of feature selection algorithm for android malware detection
title_full Evaluation of feature selection algorithm for android malware detection
title_fullStr Evaluation of feature selection algorithm for android malware detection
title_full_unstemmed Evaluation of feature selection algorithm for android malware detection
title_sort evaluation of feature selection algorithm for android malware detection
publisher Science Publishing Corporation
publishDate 2018
url http://eprints.uthm.edu.my/5019/
_version_ 1738581325885997056