Detecting malware using deep learning techniques

Over the years, malware is getting stronger and growing to become a powerful threat in the Information Technological Sector. Once infected on a computing system, the malicious software can perform malicious activities such as employing different encryption algorithms to encrypt users’ data, hinderin...

Full description

Saved in:
Bibliographic Details
Main Author: Koh, Darrell Nern Wei
Other Authors: Zhang Tianwei
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148025
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Over the years, malware is getting stronger and growing to become a powerful threat in the Information Technological Sector. Once infected on a computing system, the malicious software can perform malicious activities such as employing different encryption algorithms to encrypt users’ data, hindering the lives of many in the cyber community. Therefore, the importance of mitigating such a cybersecurity risk becomes increasing relevant in the today’s society. Given the advancement of machine learning in recent years, machine learning has achieved recognition for providing solutions to complex classification tasks. Deep learning has also become one of the machine learning technique to be incorporate into anti-malware solutions to identify malwares. The primary purpose of this project is to obtain malware features datasets which is extracted from malicious and benign Windows Portable Executable (PE) files. The 3 static analysis features namely PE section header, PE imports and PE file raw byte stream and 1 dynamic analysis feature known as API call sequences will be extracted for this project. Using the distinct features as input, 4 deep learning models are implemented to perform binary malware classification Moreover, a deep learning ensemble model, combining the above 4 neural networks classifiers, is developed to be utilise as an extension to existing endpoint security software for malware detection. The results of evaluating the ensemble model on unseen data shows a high accuracy of 99.31%, indicating a high prediction capability to classify new and unseen malware samples.