Parallel streaming frequency-based aggregates

We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithm...

全面介紹

Saved in:
書目詳細資料
Main Authors: Kanat Tangwongsan, Srikanta Tirthapura, Kun Lung Wu
其他作者: Mahidol University
格式: Conference or Workshop Item
出版: 2018
主題:
在線閱讀:https://repository.li.mahidol.ac.th/handle/123456789/33760
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Mahidol University
id th-mahidol.33760
record_format dspace
spelling th-mahidol.337602018-11-09T09:31:30Z Parallel streaming frequency-based aggregates Kanat Tangwongsan Srikanta Tirthapura Kun Lung Wu Mahidol University Iowa State University IBM Thomas J. Watson Research Center Computer Science Mathematics We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth. 2018-11-09T02:11:43Z 2018-11-09T02:11:43Z 2014-01-01 Conference Paper Annual ACM Symposium on Parallelism in Algorithms and Architectures. (2014), 236-245 10.1145/2612669.2612695 2-s2.0-84904514402 https://repository.li.mahidol.ac.th/handle/123456789/33760 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84904514402&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Computer Science
Mathematics
spellingShingle Computer Science
Mathematics
Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
Parallel streaming frequency-based aggregates
description We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a "minibatch" of items using linear work and polylog parallel depth. We also prove a lower bound showing that the work of the parallel algorithm is optimal in the case of heavy hitters and frequency estimation. To our knowledge, these are the first parallel algorithms for these problems that are provably work efficient and have low depth.
author2 Mahidol University
author_facet Mahidol University
Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
format Conference or Workshop Item
author Kanat Tangwongsan
Srikanta Tirthapura
Kun Lung Wu
author_sort Kanat Tangwongsan
title Parallel streaming frequency-based aggregates
title_short Parallel streaming frequency-based aggregates
title_full Parallel streaming frequency-based aggregates
title_fullStr Parallel streaming frequency-based aggregates
title_full_unstemmed Parallel streaming frequency-based aggregates
title_sort parallel streaming frequency-based aggregates
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/33760
_version_ 1763489889520713728