Entity identification in database integration

The objective of entity identification is to determine the correspondence between objective instances from more than one database. This paper examines the problem at the instance level assuming that schema level heterogeneity has been resolved a priori. Soundness and completeness are defined as the...

Full description

Saved in:
Bibliographic Details
Main Authors: LIM, Ee Peng, SRIVASTAVA, Jaideep, PRABHAKAR, Satya, RICHARDSON, James
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1996
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/24
https://ink.library.smu.edu.sg/context/sis_research/article/1023/viewcontent/1_s2.0_0020025595001859_main.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1023
record_format dspace
spelling sg-smu-ink.sis_research-10232018-06-20T04:30:19Z Entity identification in database integration LIM, Ee Peng SRIVASTAVA, Jaideep PRABHAKAR, Satya RICHARDSON, James The objective of entity identification is to determine the correspondence between objective instances from more than one database. This paper examines the problem at the instance level assuming that schema level heterogeneity has been resolved a priori. Soundness and completeness are defined as the desired properties of any entity-identification technique. To achieve soundness, a set of identity and distinctness rules have to be established for the entities in the integrated world. We then propose the use of extended key, which is the union of keys (and possibly other attributes) from the relations to be matched, and its corresponding identity rule to determine the equivalence between tuples from relations that may not share any common key. Instance level functional dependencies (ILFD), a form of semantic constraint information about the real-world entities, are used to derive the missing extended key attribute values of a tuple. Formal properties of ILFDs are derived. Results from a Prolog-based prototype entity-identification system are presented. 1996-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/24 info:doi/10.1016/0020-0255(95)00185-9 https://ink.library.smu.edu.sg/context/sis_research/article/1023/viewcontent/1_s2.0_0020025595001859_main.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
LIM, Ee Peng
SRIVASTAVA, Jaideep
PRABHAKAR, Satya
RICHARDSON, James
Entity identification in database integration
description The objective of entity identification is to determine the correspondence between objective instances from more than one database. This paper examines the problem at the instance level assuming that schema level heterogeneity has been resolved a priori. Soundness and completeness are defined as the desired properties of any entity-identification technique. To achieve soundness, a set of identity and distinctness rules have to be established for the entities in the integrated world. We then propose the use of extended key, which is the union of keys (and possibly other attributes) from the relations to be matched, and its corresponding identity rule to determine the equivalence between tuples from relations that may not share any common key. Instance level functional dependencies (ILFD), a form of semantic constraint information about the real-world entities, are used to derive the missing extended key attribute values of a tuple. Formal properties of ILFDs are derived. Results from a Prolog-based prototype entity-identification system are presented.
format text
author LIM, Ee Peng
SRIVASTAVA, Jaideep
PRABHAKAR, Satya
RICHARDSON, James
author_facet LIM, Ee Peng
SRIVASTAVA, Jaideep
PRABHAKAR, Satya
RICHARDSON, James
author_sort LIM, Ee Peng
title Entity identification in database integration
title_short Entity identification in database integration
title_full Entity identification in database integration
title_fullStr Entity identification in database integration
title_full_unstemmed Entity identification in database integration
title_sort entity identification in database integration
publisher Institutional Knowledge at Singapore Management University
publishDate 1996
url https://ink.library.smu.edu.sg/sis_research/24
https://ink.library.smu.edu.sg/context/sis_research/article/1023/viewcontent/1_s2.0_0020025595001859_main.pdf
_version_ 1770568852921384960