Anti-scraping tool: Sentinel

Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can...

Full description

Saved in:
Bibliographic Details
Main Authors: Alejandro, Paul Eivanhoe C., De La Paz, Algene Kevin H., Guevara, John Christopher, Ong David, John Frederick S.
Format: text
Language:English
Published: Animo Repository 2014
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/11812
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-12457
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-124572021-09-07T04:03:13Z Anti-scraping tool: Sentinel Alejandro, Paul Eivanhoe C. De La Paz, Algene Kevin H. Guevara, John Christopher Ong David, John Frederick S. Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can suffer from repetitive intrusion intervals and bots or scrapers can alter their algorithm in order to bypass the security. Existing anti-scraping tools are dependent on their predetermined list of IP addresses which can be circumvented by attaining a new authentic IP address. The study aims to develop an anti scraping tool that does not solely depend on predetermined IP addresses and diminishes the occurrences of scrapers attacking with new authentic IP addresses. Experiments show to deter automatic scrapers and keep them from repeatedly attacking with short intervals in between each attack. Sentinel detects automatic scrapers through the use of the Blacklist, Rate, Limiter, Test Provider, and Test Checker Modules, feeds the detected scrapers fake information with the use of the Web Trap Module, provides additional information on them with the IP Address Lookup Module, and delays the scrapers it cannot detect using the Computed Hide method explained later on in Chapter 5.8. 2014-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11812 Bachelor's Theses English Animo Repository
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
description Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can suffer from repetitive intrusion intervals and bots or scrapers can alter their algorithm in order to bypass the security. Existing anti-scraping tools are dependent on their predetermined list of IP addresses which can be circumvented by attaining a new authentic IP address. The study aims to develop an anti scraping tool that does not solely depend on predetermined IP addresses and diminishes the occurrences of scrapers attacking with new authentic IP addresses. Experiments show to deter automatic scrapers and keep them from repeatedly attacking with short intervals in between each attack. Sentinel detects automatic scrapers through the use of the Blacklist, Rate, Limiter, Test Provider, and Test Checker Modules, feeds the detected scrapers fake information with the use of the Web Trap Module, provides additional information on them with the IP Address Lookup Module, and delays the scrapers it cannot detect using the Computed Hide method explained later on in Chapter 5.8.
format text
author Alejandro, Paul Eivanhoe C.
De La Paz, Algene Kevin H.
Guevara, John Christopher
Ong David, John Frederick S.
spellingShingle Alejandro, Paul Eivanhoe C.
De La Paz, Algene Kevin H.
Guevara, John Christopher
Ong David, John Frederick S.
Anti-scraping tool: Sentinel
author_facet Alejandro, Paul Eivanhoe C.
De La Paz, Algene Kevin H.
Guevara, John Christopher
Ong David, John Frederick S.
author_sort Alejandro, Paul Eivanhoe C.
title Anti-scraping tool: Sentinel
title_short Anti-scraping tool: Sentinel
title_full Anti-scraping tool: Sentinel
title_fullStr Anti-scraping tool: Sentinel
title_full_unstemmed Anti-scraping tool: Sentinel
title_sort anti-scraping tool: sentinel
publisher Animo Repository
publishDate 2014
url https://animorepository.dlsu.edu.ph/etd_bachelors/11812
_version_ 1712577547033640960