Top-k Dominating Queries on Incomplete Data

The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real dataset...

Full description

Saved in:
Bibliographic Details
Main Authors: MIAO, Xiaoye, GAO, Yunjun, ZHENG, Baihua, CHEN, Gang, CUI, Huiyong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2894
https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3894
record_format dspace
spelling sg-smu-ink.sis_research-38942016-09-26T12:17:00Z Top-k Dominating Queries on Incomplete Data MIAO, Xiaoye GAO, Yunjun ZHENG, Baihua CHEN, Gang CUI, Huiyong The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real datasets, due to device failure, privacy preservation, data loss, and so on. In this paper, for the first time, we carry out a systematic study of TKD queries on incomplete data, which involves the data having some missing dimensional value(s). We formalize this problem, and propose a suite of efficient algorithms for answering TKD queries over incomplete data. Our methods employ some novel techniques, such as upper bound score pruning, bitmap pruning, and partial score pruning, to boost query efficiency. Extensive experimental evaluation using both real and synthetic datasets demonstrates the effectiveness of our developed pruning heuristics and the performance of our presented algorithms. 2016-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2894 info:doi/10.1109/TKDE.2015.2460742 https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Top-k dominating query Incomplete data Query processing Dominance relationship Algorithm Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Top-k dominating query
Incomplete data
Query processing
Dominance relationship
Algorithm
Computer Sciences
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Top-k dominating query
Incomplete data
Query processing
Dominance relationship
Algorithm
Computer Sciences
Databases and Information Systems
Numerical Analysis and Scientific Computing
MIAO, Xiaoye
GAO, Yunjun
ZHENG, Baihua
CHEN, Gang
CUI, Huiyong
Top-k Dominating Queries on Incomplete Data
description The top-k dominating (TKD) query returns the k objects that dominate the maximum number of objects in a given dataset. It combines the advantages of skyline and top-k queries, and plays an important role in many decision support applications. Incomplete data exists in a wide spectrum of real datasets, due to device failure, privacy preservation, data loss, and so on. In this paper, for the first time, we carry out a systematic study of TKD queries on incomplete data, which involves the data having some missing dimensional value(s). We formalize this problem, and propose a suite of efficient algorithms for answering TKD queries over incomplete data. Our methods employ some novel techniques, such as upper bound score pruning, bitmap pruning, and partial score pruning, to boost query efficiency. Extensive experimental evaluation using both real and synthetic datasets demonstrates the effectiveness of our developed pruning heuristics and the performance of our presented algorithms.
format text
author MIAO, Xiaoye
GAO, Yunjun
ZHENG, Baihua
CHEN, Gang
CUI, Huiyong
author_facet MIAO, Xiaoye
GAO, Yunjun
ZHENG, Baihua
CHEN, Gang
CUI, Huiyong
author_sort MIAO, Xiaoye
title Top-k Dominating Queries on Incomplete Data
title_short Top-k Dominating Queries on Incomplete Data
title_full Top-k Dominating Queries on Incomplete Data
title_fullStr Top-k Dominating Queries on Incomplete Data
title_full_unstemmed Top-k Dominating Queries on Incomplete Data
title_sort top-k dominating queries on incomplete data
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/2894
https://ink.library.smu.edu.sg/context/sis_research/article/3894/viewcontent/ZhengBH_2016_TopkDominatingQueriesIncompleteData.pdf
_version_ 1770572666559791104