Tuple Source Relational Model: A Source-Aware Data Model for Multidatabases

In some integration projects, complete integration of database instances may not be necessary. It may also be too costly and impossible to do so due to poor local data quality and insufficient instance-level knowledge. In this research, we study how multidatabases with global schemas should be repre...

Full description

Saved in:
Bibliographic Details
Main Authors: LIM, Ee Peng, CHIANG, Roger Hsiang-Li, Cao, Yinyan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1998
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/60
https://ink.library.smu.edu.sg/context/sis_research/article/1059/viewcontent/1_s2.0_S0169023X9900021X_main.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:In some integration projects, complete integration of database instances may not be necessary. It may also be too costly and impossible to do so due to poor local data quality and insufficient instance-level knowledge. In this research, we study how multidatabases with global schemas should be represented and manipulated when the data instances from the local databases do not require to be fully integrated. We propose the tuple source (TS) relational model to represent multidatabases under such an integration requirement. This model extends the classical relational model by augmenting every relation with a source attribute to identify the local database that the tuples come from. The source attribute can also be used to specify the right context to interpret global data instances. To manipulate TS relations, we have developed a set of tuple source relational algebraic operations and an extended SQL query language known as TS-SQL. With TS relational model, flexible multidatabase queries that involve instances from different local databases can be formulated easily. In this paper, we also reported our distributed query processing and optimization strategies and their implementation.