SourceVote: Fusing multi-valued data via inter-source agreements

Data fusion is a fundamental research problem of identifying true values of data items of interest from conflicting multi-sourced data. Although considerable research efforts have been conducted on this topic, existing approaches generally assume every data item has exactly one true value, which fai...

Full description

Saved in:
Bibliographic Details
Main Authors: FANG, Xiu Susie, SHENG, Quan Z., WANG, Xianzhi, BARHAMGI, Mahmoud, YAO, Lina, NGU, Anne H.H.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3941
https://ink.library.smu.edu.sg/context/sis_research/article/4943/viewcontent/101007_2F978_3_319_69904_2_13.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4943
record_format dspace
spelling sg-smu-ink.sis_research-49432018-10-03T08:56:48Z SourceVote: Fusing multi-valued data via inter-source agreements FANG, Xiu Susie SHENG, Quan Z. WANG, Xianzhi BARHAMGI, Mahmoud YAO, Lina NGU, Anne H.H. Data fusion is a fundamental research problem of identifying true values of data items of interest from conflicting multi-sourced data. Although considerable research efforts have been conducted on this topic, existing approaches generally assume every data item has exactly one true value, which fails to reflect the real world where data items with multiple true values widely exist. In this paper, we propose a novel approach,SourceVote, to estimate value veracity for multi-valued data items. SourceVote models the endorsement relations among sources by quantifying their two-sided inter-source agreements. In particular, two graphs are constructed to model inter-source relations. Then two aspects of source reliability are derived from these graphs and are used for estimating value veracity and initializing existing data fusion methods. Empirical studies on two large real-world datasets demonstrate the effectiveness of our approach. 2017-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3941 info:doi/10.1007/978-3-319-69904-2_13 https://ink.library.smu.edu.sg/context/sis_research/article/4943/viewcontent/101007_2F978_3_319_69904_2_13.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Data integration Data fusion Multi-valued data items Inter-source agreements Databases and Information Systems Data Storage Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Data integration
Data fusion
Multi-valued data items
Inter-source agreements
Databases and Information Systems
Data Storage Systems
spellingShingle Data integration
Data fusion
Multi-valued data items
Inter-source agreements
Databases and Information Systems
Data Storage Systems
FANG, Xiu Susie
SHENG, Quan Z.
WANG, Xianzhi
BARHAMGI, Mahmoud
YAO, Lina
NGU, Anne H.H.
SourceVote: Fusing multi-valued data via inter-source agreements
description Data fusion is a fundamental research problem of identifying true values of data items of interest from conflicting multi-sourced data. Although considerable research efforts have been conducted on this topic, existing approaches generally assume every data item has exactly one true value, which fails to reflect the real world where data items with multiple true values widely exist. In this paper, we propose a novel approach,SourceVote, to estimate value veracity for multi-valued data items. SourceVote models the endorsement relations among sources by quantifying their two-sided inter-source agreements. In particular, two graphs are constructed to model inter-source relations. Then two aspects of source reliability are derived from these graphs and are used for estimating value veracity and initializing existing data fusion methods. Empirical studies on two large real-world datasets demonstrate the effectiveness of our approach.
format text
author FANG, Xiu Susie
SHENG, Quan Z.
WANG, Xianzhi
BARHAMGI, Mahmoud
YAO, Lina
NGU, Anne H.H.
author_facet FANG, Xiu Susie
SHENG, Quan Z.
WANG, Xianzhi
BARHAMGI, Mahmoud
YAO, Lina
NGU, Anne H.H.
author_sort FANG, Xiu Susie
title SourceVote: Fusing multi-valued data via inter-source agreements
title_short SourceVote: Fusing multi-valued data via inter-source agreements
title_full SourceVote: Fusing multi-valued data via inter-source agreements
title_fullStr SourceVote: Fusing multi-valued data via inter-source agreements
title_full_unstemmed SourceVote: Fusing multi-valued data via inter-source agreements
title_sort sourcevote: fusing multi-valued data via inter-source agreements
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3941
https://ink.library.smu.edu.sg/context/sis_research/article/4943/viewcontent/101007_2F978_3_319_69904_2_13.pdf
_version_ 1770574021519212544