Chameleon: Diverse detection of web malware

Malware is mostly hidden in JavaScript codes but there are existing tools that detects the malicious codes. Some features of the JavaScript code could not determine the behavior of a script e.g. obfuscated code. Obfuscated code is a source code which is difficult to read for human or machines. Machi...

Full description

Saved in:
Bibliographic Details
Main Authors: Mancia, Riegel Sean D., Penafiel, Gamaliel Micah T., Sia, Kim Patrick A.
Format: text
Language:English
Published: Animo Repository 2012
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/11124
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-11769
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-117692022-03-02T02:48:32Z Chameleon: Diverse detection of web malware Mancia, Riegel Sean D. Penafiel, Gamaliel Micah T. Sia, Kim Patrick A. Malware is mostly hidden in JavaScript codes but there are existing tools that detects the malicious codes. Some features of the JavaScript code could not determine the behavior of a script e.g. obfuscated code. Obfuscated code is a source code which is difficult to read for human or machines. Machine learning algorithms can be used to classify and detect the malicious code hidden inside the web page. In order to have a classifier, data sets are first needed to be gathered before training classifiers. These data sets are hard to come by and gathering them are more reliable, rather than relying already made data. Chameleon is a plug-in which is capable to detect malicious web page. A trained classifier model is incorporated into plug-in installed in a web browser, Mozilla Firefox. Machine learning algorithms are applied to address unseen malicious threats and instances of malicious obfuscated JavaScript code. Classification algorithms are used for training and testing to build a classifier model. Random Forest is the classification algorithm used to train the classifier model. Data sets, composed of begin and malicious pages, are gathered using web crawler and malicious pages are analyzed with the use of detection tool. Benign web pages are gathered from the top list websites. Malicious web pages sort to repeat their types of attack and change the structure of their every code once in a while. Due to the changing of the attacks, frequently train classifier models with new data or better, to use adaptive learners. 2012-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11124 Bachelor's Theses English Animo Repository Malware (Computer software) Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Malware (Computer software)
Computer Sciences
spellingShingle Malware (Computer software)
Computer Sciences
Mancia, Riegel Sean D.
Penafiel, Gamaliel Micah T.
Sia, Kim Patrick A.
Chameleon: Diverse detection of web malware
description Malware is mostly hidden in JavaScript codes but there are existing tools that detects the malicious codes. Some features of the JavaScript code could not determine the behavior of a script e.g. obfuscated code. Obfuscated code is a source code which is difficult to read for human or machines. Machine learning algorithms can be used to classify and detect the malicious code hidden inside the web page. In order to have a classifier, data sets are first needed to be gathered before training classifiers. These data sets are hard to come by and gathering them are more reliable, rather than relying already made data. Chameleon is a plug-in which is capable to detect malicious web page. A trained classifier model is incorporated into plug-in installed in a web browser, Mozilla Firefox. Machine learning algorithms are applied to address unseen malicious threats and instances of malicious obfuscated JavaScript code. Classification algorithms are used for training and testing to build a classifier model. Random Forest is the classification algorithm used to train the classifier model. Data sets, composed of begin and malicious pages, are gathered using web crawler and malicious pages are analyzed with the use of detection tool. Benign web pages are gathered from the top list websites. Malicious web pages sort to repeat their types of attack and change the structure of their every code once in a while. Due to the changing of the attacks, frequently train classifier models with new data or better, to use adaptive learners.
format text
author Mancia, Riegel Sean D.
Penafiel, Gamaliel Micah T.
Sia, Kim Patrick A.
author_facet Mancia, Riegel Sean D.
Penafiel, Gamaliel Micah T.
Sia, Kim Patrick A.
author_sort Mancia, Riegel Sean D.
title Chameleon: Diverse detection of web malware
title_short Chameleon: Diverse detection of web malware
title_full Chameleon: Diverse detection of web malware
title_fullStr Chameleon: Diverse detection of web malware
title_full_unstemmed Chameleon: Diverse detection of web malware
title_sort chameleon: diverse detection of web malware
publisher Animo Repository
publishDate 2012
url https://animorepository.dlsu.edu.ph/etd_bachelors/11124
_version_ 1726158596093247488