SourceVote: Fusing multi-valued data via inter-source agreements
Data fusion is a fundamental research problem of identifyingtrue values of data items of interest from conflicting multi-sourceddata. Although considerable research efforts have been conducted on thistopic, existing approaches generally assume every data item has exactlyone true value, which fails t...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/3857 https://ink.library.smu.edu.sg/context/sis_research/article/4859/viewcontent/101007_2F978_3_319_69904_2_13.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-4859 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-48592017-11-30T06:54:45Z SourceVote: Fusing multi-valued data via inter-source agreements FANG, Xiu Susie SHENG, Quan Z. WANG, Xianzhi BARHAMGI, Mahmoud YAO, Lina NGU, Anne H.H. Data fusion is a fundamental research problem of identifyingtrue values of data items of interest from conflicting multi-sourceddata. Although considerable research efforts have been conducted on thistopic, existing approaches generally assume every data item has exactlyone true value, which fails to reflect the real world where data items withmultiple true values widely exist. In this paper, we propose a novel approach,SourceVote, to estimate value veracity for multi-valued data items.SourceVote models the endorsement relations among sources by quantifyingtheir two-sided inter-source agreements. In particular, two graphs areconstructed to model inter-source relations. Then two aspects of sourcereliability are derived from these graphs and are used for estimatingvalue veracity and initializing existing data fusion methods. Empiricalstudies on two large real-world datasets demonstrate the effectiveness ofour approach. 2017-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3857 info:doi/10.1007/978-3-319-69904-2_13 https://ink.library.smu.edu.sg/context/sis_research/article/4859/viewcontent/101007_2F978_3_319_69904_2_13.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Data integration Data fusion Multi-valued data items Inter-source agreements Databases and Information Systems Data Storage Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Data integration Data fusion Multi-valued data items Inter-source agreements Databases and Information Systems Data Storage Systems |
spellingShingle |
Data integration Data fusion Multi-valued data items Inter-source agreements Databases and Information Systems Data Storage Systems FANG, Xiu Susie SHENG, Quan Z. WANG, Xianzhi BARHAMGI, Mahmoud YAO, Lina NGU, Anne H.H. SourceVote: Fusing multi-valued data via inter-source agreements |
description |
Data fusion is a fundamental research problem of identifyingtrue values of data items of interest from conflicting multi-sourceddata. Although considerable research efforts have been conducted on thistopic, existing approaches generally assume every data item has exactlyone true value, which fails to reflect the real world where data items withmultiple true values widely exist. In this paper, we propose a novel approach,SourceVote, to estimate value veracity for multi-valued data items.SourceVote models the endorsement relations among sources by quantifyingtheir two-sided inter-source agreements. In particular, two graphs areconstructed to model inter-source relations. Then two aspects of sourcereliability are derived from these graphs and are used for estimatingvalue veracity and initializing existing data fusion methods. Empiricalstudies on two large real-world datasets demonstrate the effectiveness ofour approach. |
format |
text |
author |
FANG, Xiu Susie SHENG, Quan Z. WANG, Xianzhi BARHAMGI, Mahmoud YAO, Lina NGU, Anne H.H. |
author_facet |
FANG, Xiu Susie SHENG, Quan Z. WANG, Xianzhi BARHAMGI, Mahmoud YAO, Lina NGU, Anne H.H. |
author_sort |
FANG, Xiu Susie |
title |
SourceVote: Fusing multi-valued data via inter-source agreements |
title_short |
SourceVote: Fusing multi-valued data via inter-source agreements |
title_full |
SourceVote: Fusing multi-valued data via inter-source agreements |
title_fullStr |
SourceVote: Fusing multi-valued data via inter-source agreements |
title_full_unstemmed |
SourceVote: Fusing multi-valued data via inter-source agreements |
title_sort |
sourcevote: fusing multi-valued data via inter-source agreements |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/3857 https://ink.library.smu.edu.sg/context/sis_research/article/4859/viewcontent/101007_2F978_3_319_69904_2_13.pdf |
_version_ |
1770573827424649216 |