Pi-Web Join in a Web Warehouse

With the enormous amount of data stored in the World Wide Web, it is increasingly important to design and develop powerful web warehousing tools. The key objective of our web warehousing project, called WHOWEDA (Warehouse of Web Data), is to design and implement a web warehouse that materializes and...

Full description

Saved in:
Bibliographic Details
Main Authors: BHOWMICK, Sourav S., MADRIA, Sanjay Kumar, NG, Wee-Keong, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1999
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/981
https://ink.library.smu.edu.sg/context/sis_research/article/1980/viewcontent/58899c350f8888cf875ed7a01c49185f6493.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:With the enormous amount of data stored in the World Wide Web, it is increasingly important to design and develop powerful web warehousing tools. The key objective of our web warehousing project, called WHOWEDA (Warehouse of Web Data), is to design and implement a web warehouse that materializes and manages useful information from the Web. In this paper, we introduce the concept of Pi-web join in the context of WHOWEDA. Pi-web join operator is a web information manipulation operator to combine relevant web information residing in two web tables. Informally, it is the combination of web join and web project operators which filter out irrelevant information from a joined web table. In this paper, we show how to construct the Pi-joined web table and its schema. We also highlight the benefits of the Pi-web join operator.