On warehousing historical web information

We present a temporal web data model designed for warehousing historical data from World Wide Web that changes with time. As the Web is now populated with large volume of web information, it has become necessary to capture some useful web information in a data warehouse that supports further intelli...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Yinyan, LIM, Ee Peng, NG, Wee-Keong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2000
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/917
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1916
record_format dspace
spelling sg-smu-ink.sis_research-19162018-06-25T03:57:41Z On warehousing historical web information CAO, Yinyan LIM, Ee Peng NG, Wee-Keong We present a temporal web data model designed for warehousing historical data from World Wide Web that changes with time. As the Web is now populated with large volume of web information, it has become necessary to capture some useful web information in a data warehouse that supports further intelligent data analysis. Nevertheless, due to the unstructured and dynamic nature of Web, the traditional relational model and its temporal variants could not be used to build such a data warehouse. In this paper, we therefore propose a temporal web data model that captures the connectivities of web documents and their content in the form of temporal web tables. To support the analysis of web data that evolve with time, valid time intervals are associated with each web document. To manipulate temporal web tables, we define a variety of web operators and illustrate their usefulness using some realistic motivating examples. 2000-10-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/917 info:doi/10.1007/3-540-45393-8_19 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
CAO, Yinyan
LIM, Ee Peng
NG, Wee-Keong
On warehousing historical web information
description We present a temporal web data model designed for warehousing historical data from World Wide Web that changes with time. As the Web is now populated with large volume of web information, it has become necessary to capture some useful web information in a data warehouse that supports further intelligent data analysis. Nevertheless, due to the unstructured and dynamic nature of Web, the traditional relational model and its temporal variants could not be used to build such a data warehouse. In this paper, we therefore propose a temporal web data model that captures the connectivities of web documents and their content in the form of temporal web tables. To support the analysis of web data that evolve with time, valid time intervals are associated with each web document. To manipulate temporal web tables, we define a variety of web operators and illustrate their usefulness using some realistic motivating examples.
format text
author CAO, Yinyan
LIM, Ee Peng
NG, Wee-Keong
author_facet CAO, Yinyan
LIM, Ee Peng
NG, Wee-Keong
author_sort CAO, Yinyan
title On warehousing historical web information
title_short On warehousing historical web information
title_full On warehousing historical web information
title_fullStr On warehousing historical web information
title_full_unstemmed On warehousing historical web information
title_sort on warehousing historical web information
publisher Institutional Knowledge at Singapore Management University
publishDate 2000
url https://ink.library.smu.edu.sg/sis_research/917
_version_ 1770570769441488896