CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING
Malware or Malicious Software is malicious software designed to damage, steal important information or data, interfere with computer performance, and other criminal acts on computers or devices that can harm users. The National Cyber and Crypto Agency (BSSN) said cyber-attacks of a technical nature...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/57190 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:57190 |
---|---|
spelling |
id-itb.:571902021-07-29T08:54:39ZCLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING Akbar Abhesa, Radifa Indonesia Theses Malicious Software, CNN, image processing, machine learning INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/57190 Malware or Malicious Software is malicious software designed to damage, steal important information or data, interfere with computer performance, and other criminal acts on computers or devices that can harm users. The National Cyber and Crypto Agency (BSSN) said cyber-attacks of a technical nature in 2020 reached 495,337,202 in Indonesia. This number has doubled compared to 2019 which only reached 228,277,875. To prevent the spread and harm caused by malware, there are various methods such as using machine learning to detect and classify software suspected of being malware. The malware analysis method consists of a static method, where the suspected malware is not executed and a dynamic method, when the software is run to see and analyze its behavior. However, such an approach still takes a long time because it requires various kinds of feature analysis obtained from various types of malwares (feature extraction). In this thesis, a different malware analysis method will be proposed, namely using program visualization and image processing. This method is considered capable of producing a faster analysis process because the analysis process is uniform and overall based on visuals. This thesis aims to explain the process of classifying malware using machine learning methods based on image processing. The steps taken are to convert the software program suspected of being malware into binary bits, then convert them into strings, 8-bit vectors, and then into grayscale images. Convolutional Neural Network (CNN) is used to process malware visualization datasets so that visual patterns can be found with each other. The final model is expected to identify malware into one of the categories/families of an operating system. Parameter testing carried out in the form of measurement of accuracy, error, precision, and sensitivity of the model using a confusion matrix. In the end, the experiment was able to produce a machine learning model with an accuracy rate of 94%. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Malware or Malicious Software is malicious software designed to damage, steal important information or data, interfere with computer performance, and other criminal acts on computers or devices that can harm users. The National Cyber and Crypto Agency (BSSN) said cyber-attacks of a technical nature in 2020 reached 495,337,202 in Indonesia. This number has doubled compared to 2019 which only reached 228,277,875. To prevent the spread and harm caused by malware, there are various methods such as using machine learning to detect and classify software suspected of being malware. The malware analysis method consists of a static method, where the suspected malware is not executed and a dynamic method, when the software is run to see and analyze its behavior. However, such an approach still takes a long time because it requires various kinds of feature analysis obtained from various types of malwares (feature extraction). In this thesis, a different malware analysis method will be proposed, namely using program visualization and image processing. This method is considered capable of producing a faster analysis process because the analysis process is uniform and overall based on visuals.
This thesis aims to explain the process of classifying malware using machine learning methods based on image processing. The steps taken are to convert the software program suspected of being malware into binary bits, then convert them into strings, 8-bit vectors, and then into grayscale images. Convolutional Neural Network (CNN) is used to process malware visualization datasets so that visual patterns can be found with each other. The final model is expected to identify malware into one of the categories/families of an operating system. Parameter testing carried out in the form of measurement of accuracy, error, precision, and sensitivity of the model using a confusion matrix. In the end, the experiment was able to produce a machine learning model with an accuracy rate of 94%.
|
format |
Theses |
author |
Akbar Abhesa, Radifa |
spellingShingle |
Akbar Abhesa, Radifa CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
author_facet |
Akbar Abhesa, Radifa |
author_sort |
Akbar Abhesa, Radifa |
title |
CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
title_short |
CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
title_full |
CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
title_fullStr |
CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
title_full_unstemmed |
CLASSIFICATION OF MALWARE USING MACHINE LEARNING BASED ON IMAGE PROCESSING |
title_sort |
classification of malware using machine learning based on image processing |
url |
https://digilib.itb.ac.id/gdl/view/57190 |
_version_ |
1822002566736642048 |