Parallel streaming frequency-based aggregates
We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithm...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Published: |
2018
|
Subjects: | |
Online Access: | https://repository.li.mahidol.ac.th/handle/123456789/33760 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Mahidol University |
id |
th-mahidol.33760 |
---|---|
record_format |
dspace |
spelling |
th-mahidol.337602018-11-09T09:31:30Z Parallel streaming frequency-based aggregates Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu Mahidol University Iowa State University IBM Thomas J. Watson Research Center Computer Science Mathematics We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth. 2018-11-09T02:11:43Z 2018-11-09T02:11:43Z 2014-01-01 Conference Paper Annual ACM Symposium on Parallelism in Algorithms and Architectures. (2014), 236-245 10.1145/2612669.2612695 2-s2.0-84904514402 https://repository.li.mahidol.ac.th/handle/123456789/33760 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84904514402&origin=inward |
institution |
Mahidol University |
building |
Mahidol University Library |
continent |
Asia |
country |
Thailand Thailand |
content_provider |
Mahidol University Library |
collection |
Mahidol University Institutional Repository |
topic |
Computer Science Mathematics |
spellingShingle |
Computer Science Mathematics Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu Parallel streaming frequency-based aggregates |
description |
We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth. |
author2 |
Mahidol University |
author_facet |
Mahidol University Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu |
format |
Conference or Workshop Item |
author |
Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu |
author_sort |
Kanat Tangwongsan |
title |
Parallel streaming frequency-based aggregates |
title_short |
Parallel streaming frequency-based aggregates |
title_full |
Parallel streaming frequency-based aggregates |
title_fullStr |
Parallel streaming frequency-based aggregates |
title_full_unstemmed |
Parallel streaming frequency-based aggregates |
title_sort |
parallel streaming frequency-based aggregates |
publishDate |
2018 |
url |
https://repository.li.mahidol.ac.th/handle/123456789/33760 |
_version_ |
1763489889520713728 |