DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT

<p align="justify">The process of web scraping is done by using an application called scraper. Various tools have been built to facilitate the process of scraper development, but these tools can only be used by programmers, while users who need web scraping not only programmers but a...

Full description

Saved in:
Bibliographic Details
Main Author: ARIFANTO - NIM: 23515014, RANDY
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/30207
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:30207
spelling id-itb.:302072018-03-16T10:19:15ZDOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT ARIFANTO - NIM: 23515014, RANDY Indonesia Theses INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/30207 <p align="justify">The process of web scraping is done by using an application called scraper. Various tools have been built to facilitate the process of scraper development, but these tools can only be used by programmers, while users who need web scraping not only programmers but also non-programmers. Non programmer referred to in this research is someone who is able to run the computer through the shell and less experienced in programming. It takes a suitable approach in the construction of scrapers by non-programmers. Therefore, in this research will be built scrapingDSL, a domain specific languages that can be used by non-programmers to build scraper. Scrapers built using scrapingDSL must be able to do web scraping against all types of websites with different structures and should be easy to understand, write and modify. To build a scrapingDSL that can meet those needs, the researcher analyzed the tools used in web scraping to determine the right tool for the scrapingDSL engine, analyzed the website structure to determine the need for the scrapingDSL and the translator application, and analyzed the the website content retrieval flow to determine the required language scrapingDSL elements. At the end of the study a test was conducted to determine whether scrapingDSL could be used by non-programmers to build scrapers. Testing is done by requesting 12 non-programmer respondents to build scraper using scrapingDSL.<p align="justify"> text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description <p align="justify">The process of web scraping is done by using an application called scraper. Various tools have been built to facilitate the process of scraper development, but these tools can only be used by programmers, while users who need web scraping not only programmers but also non-programmers. Non programmer referred to in this research is someone who is able to run the computer through the shell and less experienced in programming. It takes a suitable approach in the construction of scrapers by non-programmers. Therefore, in this research will be built scrapingDSL, a domain specific languages that can be used by non-programmers to build scraper. Scrapers built using scrapingDSL must be able to do web scraping against all types of websites with different structures and should be easy to understand, write and modify. To build a scrapingDSL that can meet those needs, the researcher analyzed the tools used in web scraping to determine the right tool for the scrapingDSL engine, analyzed the website structure to determine the need for the scrapingDSL and the translator application, and analyzed the the website content retrieval flow to determine the required language scrapingDSL elements. At the end of the study a test was conducted to determine whether scrapingDSL could be used by non-programmers to build scrapers. Testing is done by requesting 12 non-programmer respondents to build scraper using scrapingDSL.<p align="justify">
format Theses
author ARIFANTO - NIM: 23515014, RANDY
spellingShingle ARIFANTO - NIM: 23515014, RANDY
DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
author_facet ARIFANTO - NIM: 23515014, RANDY
author_sort ARIFANTO - NIM: 23515014, RANDY
title DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
title_short DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
title_full DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
title_fullStr DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
title_full_unstemmed DOMAIN SPECIFIC LANGUAGES FOR SCRAPER DEVELOPMENT
title_sort domain specific languages for scraper development
url https://digilib.itb.ac.id/gdl/view/30207
_version_ 1822923170006433792