Storage management of a historical web warehousing system

In this paper, we present the storage management of the WHOWEDA web warehousing system, which warehouses historical web information. To facilitate inter-table and intra-table sharing of web pages, we propose a three-layer storage architecture, that consists of tuple, table, and pool layers of storag...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Yinyan, LIM, Ee Peng, NG, Wee-Keong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2000
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/983
http://doi.org/10.1007/3-540-44469-6_43
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:In this paper, we present the storage management of the WHOWEDA web warehousing system, which warehouses historical web information. To facilitate inter-table and intra-table sharing of web pages, we propose a three-layer storage architecture, that consists of tuple, table, and pool layers of storage modules storing different part of warehoused web information. To improve retrieval efficiency, we have chosen to replicate some mode attributes across web tables in the table layer while keeping only unique copies of web pages at the pool layer. The separation of table and pool layer storage also allows different valid times to be maintained by multiple web tables for the same web pages due to different schedules of global coupling across web tables. As the sharing of web pages may lead to valid time inconsistency between different web tables, we propose an update synchronization scheme to resolve the valid time differences on user request.