A Comparison of Open Source Web Crawlers for E-Commerce Websites

© 2020 IEEE. Web crawlers are important tools for retrieving data such as text or image from the internet. It is an automated program that can retrieve information from the websites. E-commerce websites are essential areas for crawler applications. Moreover, large-scale crawling involves many proble...

Full description

Saved in:
Bibliographic Details
Main Authors: Desheng Yang, Pree Thiengburanathum
Format: Conference Proceeding
Published: 2020
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85085598345&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/70144
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Description
Summary:© 2020 IEEE. Web crawlers are important tools for retrieving data such as text or image from the internet. It is an automated program that can retrieve information from the websites. E-commerce websites are essential areas for crawler applications. Moreover, large-scale crawling involves many problems on the internet. Poor performing web crawlers can waste many resources for development and maintenance. Thus, choosing a suitable open source crawler becomes a huge challenge. This paper attempts to review the published previous studies on open source crawlers. The paper focuses on summarizing the performance evaluation methods of open source web crawlers, possible research trends and related research gaps. In addition, a proposed framework of the open source crawler evaluation was presented.