Top-k Dominating Queries on Incomplete Data
The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real dataset...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2894 https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3894 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-38942016-09-26T12:17:00Z Top-k Dominating Queries on Incomplete Data MIAO, Xiaoye GAO, Yunjun ZHENG, Baihua CHEN, Gang CUI, Huiyong The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real datasets, due to device failure, privacy preservation, data loss, and so on. In this paper, for the first time, we carry out a systematic study of TKD queries on incomplete data, which involves the data having some missing dimensional value(s). We formalize this problem, and propose a suite of efficient algorithms for answering TKD queries over incomplete data. Our methods employ some novel techniques, such as upper bound score pruning, bitmap pruning, and partial score pruning, to boost query efficiency. Extensive experimental evaluation using both real and synthetic datasets demonstrates the effectiveness of our developed pruning heuristics and the performance of our presented algorithms. 2016-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2894 info:doi/10.1109/TKDE.2015.2460742 https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Top-k dominating query Incomplete data Query processing Dominance relationship Algorithm Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Top-k dominating query Incomplete data Query processing Dominance relationship Algorithm Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Top-k dominating query Incomplete data Query processing Dominance relationship Algorithm Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing MIAO, Xiaoye GAO, Yunjun ZHENG, Baihua CHEN, Gang CUI, Huiyong Top-k Dominating Queries on Incomplete Data |
description |
The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real datasets, due to device failure, privacy preservation, data loss, and so on. In this paper, for the first time, we carry out a systematic study of TKD queries on incomplete data, which involves the data having some missing dimensional value(s). We formalize this problem, and propose a suite of efficient algorithms for answering TKD queries over incomplete data. Our methods employ some novel techniques, such as upper bound score pruning, bitmap pruning, and partial score pruning, to boost query efficiency. Extensive experimental evaluation using both real and synthetic datasets demonstrates the effectiveness of our developed pruning heuristics and the performance of our presented algorithms. |
format |
text |
author |
MIAO, Xiaoye GAO, Yunjun ZHENG, Baihua CHEN, Gang CUI, Huiyong |
author_facet |
MIAO, Xiaoye GAO, Yunjun ZHENG, Baihua CHEN, Gang CUI, Huiyong |
author_sort |
MIAO, Xiaoye |
title |
Top-k Dominating Queries on Incomplete Data |
title_short |
Top-k Dominating Queries on Incomplete Data |
title_full |
Top-k Dominating Queries on Incomplete Data |
title_fullStr |
Top-k Dominating Queries on Incomplete Data |
title_full_unstemmed |
Top-k Dominating Queries on Incomplete Data |
title_sort |
top-k dominating queries on incomplete data |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/sis_research/2894 https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf |
_version_ |
1770572666559791104 |