Parallel streaming frequency-based aggregates

We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithm...

Full description

Saved in:
Bibliographic Details
Main Authors: Kanat Tangwongsan, Srikanta Tirthapura, Kun Lung Wu
Other Authors: Mahidol University
Format: Conference or Workshop Item
Published: 2018
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/33760
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.33760
record_format dspace
spelling th-mahidol.337602018-11-09T09:31:30Z Parallel streaming frequency-based aggregates Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu Mahidol University Iowa State University IBM Thomas J. Watson Research Center Computer Science Mathematics We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth. 2018-11-09T02:11:43Z 2018-11-09T02:11:43Z 2014-01-01 Conference Paper Annual ACM Symposium on Parallelism in Algorithms and Architectures. (2014), 236-245 10.1145/2612669.2612695 2-s2.0-84904514402 https://repository.li.mahidol.ac.th/handle/123456789/33760 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84904514402&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Computer Science
Mathematics
spellingShingle Computer Science
Mathematics
Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
Parallel streaming frequency-based aggregates
description We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth.
author2 Mahidol University
author_facet Mahidol University
Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
format Conference or Workshop Item
author Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
author_sort Kanat Tangwongsan
title Parallel streaming frequency-based aggregates
title_short Parallel streaming frequency-based aggregates
title_full Parallel streaming frequency-based aggregates
title_fullStr Parallel streaming frequency-based aggregates
title_full_unstemmed Parallel streaming frequency-based aggregates
title_sort parallel streaming frequency-based aggregates
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/33760
_version_ 1763489889520713728