Cost-benefit analysis of bags in a web warehouse

Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called W...

Full description

Saved in:
Bibliographic Details
Main Authors: BHOWMICK, Sourav S., MADRIA, Sanjay Kumar, NG, Wee-Keong, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1999
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/931
http://doi.org/10.1109/IDEAS.1999.787249
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called WHOWEDA (Warehouse Of Weda Data) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples.Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous docu-ments and luminous paths. In this paper, we provide a cost-benefit analysis of materializing web bags as compared to web tables with distinct web tuples.