Coresets for minimum enclosing balls over sliding windows
Coresets are important tools to generate concise summaries of massive datasets for approximate analysis. A coreset is a small subset of points extracted from the original point set such that certain geometric properties are preserved with provable guarantees. This paper investigates the problem of m...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2019
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4618 https://ink.library.smu.edu.sg/context/sis_research/article/5621/viewcontent/1905.03718.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5621 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-56212020-01-02T08:51:47Z Coresets for minimum enclosing balls over sliding windows WANG, Yanhao LI, Yuchen TAN, Kian-Lee Coresets are important tools to generate concise summaries of massive datasets for approximate analysis. A coreset is a small subset of points extracted from the original point set such that certain geometric properties are preserved with provable guarantees. This paper investigates the problem of maintaining a coreset to preserve the minimum enclosing ball (MEB) for a sliding window of points that are continuously updated in a data stream. Although the problem has been extensively studied in batch and append-only streaming settings, no efficient sliding-window solution is available yet. In this work, we first introduce an algorithm, called AOMEB, to build a coreset for MEB in an append-only stream. AOMEB improves the practical performance of the state-of-the-art algorithm while having the same approximation ratio. Furthermore, using AOMEB as a building block, we propose two novel algorithms, namely SWMEB and SWMEB+, to maintain coresets for MEB over the sliding window with constant approximation ratios. The proposed algorithms also support coresets for MEB in a reproducing kernel Hilbert space (RKHS). Finally, extensive experiments on real-world and synthetic datasets demonstrate that SWMEB and SWMEB+ achieve speedups of up to four orders of magnitude over the state-of-the-art batch algorithm while providing coresets for MEB with rather small errors compared to the optimal ones. 2019-08-08T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4618 info:doi/10.1145/3292500.3330826 https://ink.library.smu.edu.sg/context/sis_research/article/5621/viewcontent/1905.03718.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Computer Engineering |
spellingShingle |
Computer Engineering WANG, Yanhao LI, Yuchen TAN, Kian-Lee Coresets for minimum enclosing balls over sliding windows |
description |
Coresets are important tools to generate concise summaries of massive datasets for approximate analysis. A coreset is a small subset of points extracted from the original point set such that certain geometric properties are preserved with provable guarantees. This paper investigates the problem of maintaining a coreset to preserve the minimum enclosing ball (MEB) for a sliding window of points that are continuously updated in a data stream. Although the problem has been extensively studied in batch and append-only streaming settings, no efficient sliding-window solution is available yet. In this work, we first introduce an algorithm, called AOMEB, to build a coreset for MEB in an append-only stream. AOMEB improves the practical performance of the state-of-the-art algorithm while having the same approximation ratio. Furthermore, using AOMEB as a building block, we propose two novel algorithms, namely SWMEB and SWMEB+, to maintain coresets for MEB over the sliding window with constant approximation ratios. The proposed algorithms also support coresets for MEB in a reproducing kernel Hilbert space (RKHS). Finally, extensive experiments on real-world and synthetic datasets demonstrate that SWMEB and SWMEB+ achieve speedups of up to four orders of magnitude over the state-of-the-art batch algorithm while providing coresets for MEB with rather small errors compared to the optimal ones. |
format |
text |
author |
WANG, Yanhao LI, Yuchen TAN, Kian-Lee |
author_facet |
WANG, Yanhao LI, Yuchen TAN, Kian-Lee |
author_sort |
WANG, Yanhao |
title |
Coresets for minimum enclosing balls over sliding windows |
title_short |
Coresets for minimum enclosing balls over sliding windows |
title_full |
Coresets for minimum enclosing balls over sliding windows |
title_fullStr |
Coresets for minimum enclosing balls over sliding windows |
title_full_unstemmed |
Coresets for minimum enclosing balls over sliding windows |
title_sort |
coresets for minimum enclosing balls over sliding windows |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2019 |
url |
https://ink.library.smu.edu.sg/sis_research/4618 https://ink.library.smu.edu.sg/context/sis_research/article/5621/viewcontent/1905.03718.pdf |
_version_ |
1770574942108123136 |