Cost-benefit analysis of bags in a web warehouse
Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called W...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
1999
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/931 http://doi.org/10.1109/IDEAS.1999.787249 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Summary: | Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called WHOWEDA (Warehouse Of Weda Data) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples.Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous docu-ments and luminous paths. In this paper, we provide a cost-benefit analysis of materializing web bags as compared to web tables with distinct web tuples. |
---|