Android malware detection based on novel representations of apps

In the past decade, advancements in computer vision (CV) and natural language processing (NLP) have been driven significantly by deep representation learning. This progress has made image and text representation learning appealing for applications in fields like malware detection, where deep learnin...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Tiezhu, DAOUDI, Nadia, ALLIX, Kevin, SAMHI, Jordan, KIM, Kisub, ZHOU, Xin, KABORE, Abdoul K., KIM, Dongsun, David LO, BISSYANDE, Tegawende F., KLEIN, Jacques
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2025
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9863
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10863
record_format dspace
spelling sg-smu-ink.sis_research-108632024-12-24T02:24:02Z Android malware detection based on novel representations of apps SUN, Tiezhu DAOUDI, Nadia ALLIX, Kevin SAMHI, Jordan KIM, Kisub ZHOU, Xin KABORE, Abdoul K. KIM, Dongsun David LO, BISSYANDE, Tegawende F. KLEIN, Jacques In the past decade, advancements in computer vision (CV) and natural language processing (NLP) have been driven significantly by deep representation learning. This progress has made image and text representation learning appealing for applications in fields like malware detection, where deep learning methods can overcome the limitations of traditional hand-crafted feature-based approaches, offering enhanced adaptability to various malware variants. This chapter introduces two novel approaches in malware representation learning that leverage these advancements: DexRay and DexBERT. DexRay employs image-based techniques, transforming DEX file bytecode of apps into grayscale “vector” images. These images are then analyzed using a one-dimensional convolutional neural model to determine the presence of malware. DexBERT, inspired by the BERT language model, processes Smali instructions disassembled from bytecode to generate high-level embedding vectors. These vectors are pivotal for tasks such as malicious code localization and malware detection. Both DexRay and DexBERT have demonstrated significant improvements over traditional machine learning methods in malware detection, particularly in terms of accuracy, efficiency, and adaptability to new malware types. This chapter delves into the methodologies and experimental results of these techniques, highlighting their contributions to the field of malware detection and offering insights into their potential for broader applications in cybersecurity. 2025-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/9863 info:doi/10.1007/978-3-031-66245-4_8 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Information Security Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Information Security
Software Engineering
spellingShingle Information Security
Software Engineering
SUN, Tiezhu
DAOUDI, Nadia
ALLIX, Kevin
SAMHI, Jordan
KIM, Kisub
ZHOU, Xin
KABORE, Abdoul K.
KIM, Dongsun
David LO,
BISSYANDE, Tegawende F.
KLEIN, Jacques
Android malware detection based on novel representations of apps
description In the past decade, advancements in computer vision (CV) and natural language processing (NLP) have been driven significantly by deep representation learning. This progress has made image and text representation learning appealing for applications in fields like malware detection, where deep learning methods can overcome the limitations of traditional hand-crafted feature-based approaches, offering enhanced adaptability to various malware variants. This chapter introduces two novel approaches in malware representation learning that leverage these advancements: DexRay and DexBERT. DexRay employs image-based techniques, transforming DEX file bytecode of apps into grayscale “vector” images. These images are then analyzed using a one-dimensional convolutional neural model to determine the presence of malware. DexBERT, inspired by the BERT language model, processes Smali instructions disassembled from bytecode to generate high-level embedding vectors. These vectors are pivotal for tasks such as malicious code localization and malware detection. Both DexRay and DexBERT have demonstrated significant improvements over traditional machine learning methods in malware detection, particularly in terms of accuracy, efficiency, and adaptability to new malware types. This chapter delves into the methodologies and experimental results of these techniques, highlighting their contributions to the field of malware detection and offering insights into their potential for broader applications in cybersecurity.
format text
author SUN, Tiezhu
DAOUDI, Nadia
ALLIX, Kevin
SAMHI, Jordan
KIM, Kisub
ZHOU, Xin
KABORE, Abdoul K.
KIM, Dongsun
David LO,
BISSYANDE, Tegawende F.
KLEIN, Jacques
author_facet SUN, Tiezhu
DAOUDI, Nadia
ALLIX, Kevin
SAMHI, Jordan
KIM, Kisub
ZHOU, Xin
KABORE, Abdoul K.
KIM, Dongsun
David LO,
BISSYANDE, Tegawende F.
KLEIN, Jacques
author_sort SUN, Tiezhu
title Android malware detection based on novel representations of apps
title_short Android malware detection based on novel representations of apps
title_full Android malware detection based on novel representations of apps
title_fullStr Android malware detection based on novel representations of apps
title_full_unstemmed Android malware detection based on novel representations of apps
title_sort android malware detection based on novel representations of apps
publisher Institutional Knowledge at Singapore Management University
publishDate 2025
url https://ink.library.smu.edu.sg/sis_research/9863
_version_ 1820027802662993920