Anti-scraping tool: Sentinel
Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2014
|
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/11812 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_bachelors-12457 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_bachelors-124572021-09-07T04:03:13Z Anti-scraping tool: Sentinel Alejandro, Paul Eivanhoe C. De La Paz, Algene Kevin H. Guevara, John Christopher Ong David, John Frederick S. Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can suffer from repetitive intrusion intervals and bots or scrapers can alter their algorithm in order to bypass the security. Existing anti-scraping tools are dependent on their predetermined list of IP addresses which can be circumvented by attaining a new authentic IP address. The study aims to develop an anti scraping tool that does not solely depend on predetermined IP addresses and diminishes the occurrences of scrapers attacking with new authentic IP addresses. Experiments show to deter automatic scrapers and keep them from repeatedly attacking with short intervals in between each attack. Sentinel detects automatic scrapers through the use of the Blacklist, Rate, Limiter, Test Provider, and Test Checker Modules, feeds the detected scrapers fake information with the use of the Web Trap Module, provides additional information on them with the IP Address Lookup Module, and delays the scrapers it cannot detect using the Computed Hide method explained later on in Chapter 5.8. 2014-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11812 Bachelor's Theses English Animo Repository |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
description |
Anti-scraping tools (AST) are network security applications that are tasked to ensure that no bots or scrapers can access the website and extract different information. Certain flaws of most anti-scraping tools are only blocking and preventing bots and scrapers intruding in the website. Websites can suffer from repetitive intrusion intervals and bots or scrapers can alter their algorithm in order to bypass the security. Existing anti-scraping tools are dependent on their predetermined list of IP addresses which can be circumvented by attaining a new authentic IP address. The study aims to develop an anti scraping tool that does not solely depend on predetermined IP addresses and diminishes the occurrences of scrapers attacking with new authentic IP addresses. Experiments show to deter automatic scrapers and keep them from repeatedly attacking with short intervals in between each attack. Sentinel detects automatic scrapers through the use of the Blacklist, Rate, Limiter, Test Provider, and Test Checker Modules, feeds the detected scrapers fake information with the use of the Web Trap Module, provides additional information on them with the IP Address Lookup Module, and delays the scrapers it cannot detect using the Computed Hide method explained later on in Chapter 5.8. |
format |
text |
author |
Alejandro, Paul Eivanhoe C. De La Paz, Algene Kevin H. Guevara, John Christopher Ong David, John Frederick S. |
spellingShingle |
Alejandro, Paul Eivanhoe C. De La Paz, Algene Kevin H. Guevara, John Christopher Ong David, John Frederick S. Anti-scraping tool: Sentinel |
author_facet |
Alejandro, Paul Eivanhoe C. De La Paz, Algene Kevin H. Guevara, John Christopher Ong David, John Frederick S. |
author_sort |
Alejandro, Paul Eivanhoe C. |
title |
Anti-scraping tool: Sentinel |
title_short |
Anti-scraping tool: Sentinel |
title_full |
Anti-scraping tool: Sentinel |
title_fullStr |
Anti-scraping tool: Sentinel |
title_full_unstemmed |
Anti-scraping tool: Sentinel |
title_sort |
anti-scraping tool: sentinel |
publisher |
Animo Repository |
publishDate |
2014 |
url |
https://animorepository.dlsu.edu.ph/etd_bachelors/11812 |
_version_ |
1712577547033640960 |