An Algebraic Transformation Framework for Multidatabase Queries

Existence of semantic conflicts between component databases severely impacts query processing in a multidatabase system. In this paper, we describe two types of semantic conflicts that have to be dealt with in the integration of databases modeling information about related sets of real-world entitie...

Full description

Saved in:
Bibliographic Details
Main Authors: LIM, Ee Peng, SRIVASTAVA, Jaideep, HWANG, San-Yih
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1995
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/13
https://doi.org/10.1007/BF01418060
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1012
record_format dspace
spelling sg-smu-ink.sis_research-10122018-06-12T07:30:25Z An Algebraic Transformation Framework for Multidatabase Queries LIM, Ee Peng SRIVASTAVA, Jaideep HWANG, San-Yih Existence of semantic conflicts between component databases severely impacts query processing in a multidatabase system. In this paper, we describe two types of semantic conflicts that have to be dealt with in the integration of databases modeling information about related sets of real-world entities. These are the entity identification problem and the attribute value conflict problem. While the two-way outerjoin operation has been commonly used for resolving entity identification problem between two component relations, outerjoins using regular equality comparisons between component relation keys is shown to produce counter-intuitive entity identification result. We remedy this by defining a new key-equality comparator in place of regular equality comparator, for outerjoins. For the attribute value conflict problem, we define a Generalized Attribute Derivation (GAD) operation which allows user-defined attribute derivation functions to be used to compute new attributes from the component relations' attributes. By adding two-way outerjoin andGAD to the set of relational operations, the traditional algebraic transformation framework for relational queries is no longer adequate for multidatabase query processing and optimization. As a result, we introduce constrained query tree as the multidatabase query representation. We show that some knowledge about query predicates and attribute derivation functions can be used to simplify queries. Such knowledge is modeled as an outerjoin graph attached to every outerjoin operation in the query tree. Based on this, we further extend the traditional algebraic transformation framework to include two-way outerjoins and GAD operations. Our framework demonstrates that properties of selection/join predicates and attribute derivation functions can be used to provide interesting transformation alternatives. This framework also serves as a formal ground for developing optimization strategies for multidatabase queries. 1995-07-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/13 info:doi/10.1007/BF01418060 https://doi.org/10.1007/BF01418060 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University multidatabase query integration operation algebraic transformation constrained query tree outerjoin graph Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic multidatabase query
integration operation
algebraic transformation
constrained query tree
outerjoin graph
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle multidatabase query
integration operation
algebraic transformation
constrained query tree
outerjoin graph
Databases and Information Systems
Numerical Analysis and Scientific Computing
LIM, Ee Peng
SRIVASTAVA, Jaideep
HWANG, San-Yih
An Algebraic Transformation Framework for Multidatabase Queries
description Existence of semantic conflicts between component databases severely impacts query processing in a multidatabase system. In this paper, we describe two types of semantic conflicts that have to be dealt with in the integration of databases modeling information about related sets of real-world entities. These are the entity identification problem and the attribute value conflict problem. While the two-way outerjoin operation has been commonly used for resolving entity identification problem between two component relations, outerjoins using regular equality comparisons between component relation keys is shown to produce counter-intuitive entity identification result. We remedy this by defining a new key-equality comparator in place of regular equality comparator, for outerjoins. For the attribute value conflict problem, we define a Generalized Attribute Derivation (GAD) operation which allows user-defined attribute derivation functions to be used to compute new attributes from the component relations' attributes. By adding two-way outerjoin andGAD to the set of relational operations, the traditional algebraic transformation framework for relational queries is no longer adequate for multidatabase query processing and optimization. As a result, we introduce constrained query tree as the multidatabase query representation. We show that some knowledge about query predicates and attribute derivation functions can be used to simplify queries. Such knowledge is modeled as an outerjoin graph attached to every outerjoin operation in the query tree. Based on this, we further extend the traditional algebraic transformation framework to include two-way outerjoins and GAD operations. Our framework demonstrates that properties of selection/join predicates and attribute derivation functions can be used to provide interesting transformation alternatives. This framework also serves as a formal ground for developing optimization strategies for multidatabase queries.
format text
author LIM, Ee Peng
SRIVASTAVA, Jaideep
HWANG, San-Yih
author_facet LIM, Ee Peng
SRIVASTAVA, Jaideep
HWANG, San-Yih
author_sort LIM, Ee Peng
title An Algebraic Transformation Framework for Multidatabase Queries
title_short An Algebraic Transformation Framework for Multidatabase Queries
title_full An Algebraic Transformation Framework for Multidatabase Queries
title_fullStr An Algebraic Transformation Framework for Multidatabase Queries
title_full_unstemmed An Algebraic Transformation Framework for Multidatabase Queries
title_sort algebraic transformation framework for multidatabase queries
publisher Institutional Knowledge at Singapore Management University
publishDate 1995
url https://ink.library.smu.edu.sg/sis_research/13
https://doi.org/10.1007/BF01418060
_version_ 1770568847811674112