Semantic query transformation for the intelligent integration of information sources

In recent years, there has been a dramatic growth in the number of publicly accessible databases on the Internet and all indicators suggest that this growth should continue in the years to come. Unfortunately, retrieving information from these databases is not easy for several reasons. The first com...

Full description

Saved in:
Bibliographic Details
Main Author: IBRAHIM, Ismail Khalil
Format: Theses and Dissertations NonPeerReviewed
Published: Universitas Gadjah Mada 2001
Subjects:
Online Access:https://repository.ugm.ac.id/171815/
http://etd.repository.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=96
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universitas Gadjah Mada
id id-ugm-repo.171815
record_format dspace
spelling id-ugm-repo.1718152017-08-30T04:53:03Z https://repository.ugm.ac.id/171815/ Semantic query transformation for the intelligent integration of information sources IBRAHIM, Ismail Khalil Database Management Mechanical Engineering In recent years, there has been a dramatic growth in the number of publicly accessible databases on the Internet and all indicators suggest that this growth should continue in the years to come. Unfortunately, retrieving information from these databases is not easy for several reasons. The first complication is distribution. Not every query can be answered by the data in a single database. Useful relations may be broken into fragments that are distributed among distinct databases. In horizontal fragmentation, the rows of a database are split across multiple databases. In vertical fragmentation, the columns are split. Distributed databases can exhibit mixtures of these types of fragmentation A second complication in database integration is heterogeneity. This heterogeneity may be notational or conceptual. Notational heterogeneity concerns access languages and protocols. One source may require SQL while another requires OQL and a third uses an ad hoc notation. This sort of heterogeneity can usually be handled through commercial products (such as the Sybase Openserver). However, even if we assume that all databases use a standard language and protocol, there can still be conceptual heterogeneity, i.e., differences in the relational schema and vocabulary. Distinct databases may use different words to refer to the same concept andor they may use the same word to refer to different concepts. Reassembling the distributed fragments of a database in the face.of heterogeneity is doubly difficult. Mediation is a technology which inserts intelligent processing modules, called mediators, between servers and clients to provide value-added processing. A number of contractors have now the capability to build the required application interfaces and to implement the architecture. The number of platforms and languages varies and there is some discussion on style, as preferring fat versus thin mediators. They interact with their customers to acquire domain knowledge. As more implementations enter practice, the infrastructure grows and we expect that mediators can be installed rapidly and be maintained by their owners. The main goal of this research is to transform the problem of answering queries using views into a semantic query optimization problem (which we called semantic query transformation since it interleaves the query planning and query execution processes) and to show that additional semantic knowledge in the form of integrity constraints can help in generating more efficient query plans suitable for data integration systems over network-bound, autonomous data sources ranging from conventional databases on the LAN or intranet to Web-based sources (both HTML and XML) across the Internet. In doing so, four derived goals were identified: to present a language for the modeling of the contents of the information sources, to propose algorithms, which transform answering queries using views problem into a semantic query optimization problem, to extend the algorithms to find the maximally contained query plans in the presence of hnctional dependencies in the world schema and to test the completeness and soundness of the algorithms. Universitas Gadjah Mada 2001 Thesis NonPeerReviewed IBRAHIM, Ismail Khalil (2001) Semantic query transformation for the intelligent integration of information sources. Other thesis, Universitas Gadjah Mada. http://etd.repository.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=96
institution Universitas Gadjah Mada
building UGM Library
country Indonesia
collection Repository Civitas UGM
topic Database Management
Mechanical Engineering
spellingShingle Database Management
Mechanical Engineering
IBRAHIM, Ismail Khalil
Semantic query transformation for the intelligent integration of information sources
description In recent years, there has been a dramatic growth in the number of publicly accessible databases on the Internet and all indicators suggest that this growth should continue in the years to come. Unfortunately, retrieving information from these databases is not easy for several reasons. The first complication is distribution. Not every query can be answered by the data in a single database. Useful relations may be broken into fragments that are distributed among distinct databases. In horizontal fragmentation, the rows of a database are split across multiple databases. In vertical fragmentation, the columns are split. Distributed databases can exhibit mixtures of these types of fragmentation A second complication in database integration is heterogeneity. This heterogeneity may be notational or conceptual. Notational heterogeneity concerns access languages and protocols. One source may require SQL while another requires OQL and a third uses an ad hoc notation. This sort of heterogeneity can usually be handled through commercial products (such as the Sybase Openserver). However, even if we assume that all databases use a standard language and protocol, there can still be conceptual heterogeneity, i.e., differences in the relational schema and vocabulary. Distinct databases may use different words to refer to the same concept andor they may use the same word to refer to different concepts. Reassembling the distributed fragments of a database in the face.of heterogeneity is doubly difficult. Mediation is a technology which inserts intelligent processing modules, called mediators, between servers and clients to provide value-added processing. A number of contractors have now the capability to build the required application interfaces and to implement the architecture. The number of platforms and languages varies and there is some discussion on style, as preferring fat versus thin mediators. They interact with their customers to acquire domain knowledge. As more implementations enter practice, the infrastructure grows and we expect that mediators can be installed rapidly and be maintained by their owners. The main goal of this research is to transform the problem of answering queries using views into a semantic query optimization problem (which we called semantic query transformation since it interleaves the query planning and query execution processes) and to show that additional semantic knowledge in the form of integrity constraints can help in generating more efficient query plans suitable for data integration systems over network-bound, autonomous data sources ranging from conventional databases on the LAN or intranet to Web-based sources (both HTML and XML) across the Internet. In doing so, four derived goals were identified: to present a language for the modeling of the contents of the information sources, to propose algorithms, which transform answering queries using views problem into a semantic query optimization problem, to extend the algorithms to find the maximally contained query plans in the presence of hnctional dependencies in the world schema and to test the completeness and soundness of the algorithms.
format Theses and Dissertations
NonPeerReviewed
author IBRAHIM, Ismail Khalil
author_facet IBRAHIM, Ismail Khalil
author_sort IBRAHIM, Ismail Khalil
title Semantic query transformation for the intelligent integration of information sources
title_short Semantic query transformation for the intelligent integration of information sources
title_full Semantic query transformation for the intelligent integration of information sources
title_fullStr Semantic query transformation for the intelligent integration of information sources
title_full_unstemmed Semantic query transformation for the intelligent integration of information sources
title_sort semantic query transformation for the intelligent integration of information sources
publisher Universitas Gadjah Mada
publishDate 2001
url https://repository.ugm.ac.id/171815/
http://etd.repository.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=96
_version_ 1681234248434974720