On low-risk heavy hitters and sparse recovery schemes

We study the heavy hitters and related sparse recovery problems in the low failure probability regime. This regime is not well-understood, and the main previous work on this is by Gilbert et al. (ICALP'13). We recognize an error in their analysis, improve their results, and contribute new spars...

Full description

Saved in:
Bibliographic Details
Main Authors: Li, Yi, Nakos, Vasileios, Woodruff, David P.
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/89386
http://hdl.handle.net/10220/46212
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-89386
record_format dspace
spelling sg-ntu-dr.10356-893862023-02-28T19:36:21Z On low-risk heavy hitters and sparse recovery schemes Li, Yi Nakos, Vasileios Woodruff, David P. School of Physical and Mathematical Sciences Heavy Hitters DRNTU::Science::Mathematics Sparse Recovery We study the heavy hitters and related sparse recovery problems in the low failure probability regime. This regime is not well-understood, and the main previous work on this is by Gilbert et al. (ICALP'13). We recognize an error in their analysis, improve their results, and contribute new sparse recovery algorithms, as well as provide upper and lower bounds for the heavy hitters problem with low failure probability. Our results are summarized as follows: 1) (Heavy Hitters) We study three natural variants for finding heavy hitters in the strict turnstile model, where the variant depends on the quality of the desired output. For the weakest variant, we give a randomized algorithm improving the failure probability analysis of the ubiquitous Count-Min data structure. We also give a new lower bound for deterministic schemes, resolving a question about this variant posed in Question 4 in the IITK Workshop on Algorithms for Data Streams (2006). Under the strongest and well-studied l_{infty}/ l_2 variant, we show that the classical Count-Sketch data structure is optimal for very low failure probabilities, which was previously unknown. 2) (Sparse Recovery Algorithms) For non-adaptive sparse-recovery, we give sublinear-time algorithms with low-failure probability, which improve upon Gilbert et al. (ICALP'13). In the adaptive case, we improve the failure probability from a constant by Indyk et al. (FOCS '11) to e^{-k^{0.99}}, where k is the sparsity parameter. 3) (Optimal Average-Case Sparse Recovery Bounds) We give matching upper and lower bounds in all parameters, including the failure probability, for the measurement complexity of the l_2/l_2 sparse recovery problem in the spiked-covariance model, completely settling its complexity in this model. Published version 2018-10-03T07:30:34Z 2019-12-06T17:24:21Z 2018-10-03T07:30:34Z 2019-12-06T17:24:21Z 2018 Journal Article Li, Y., Nakos, V., & Woodruff, D. P. (2018). On low-risk heavy hitters and sparse recovery schemes. Leibniz International Proceedings in Informatics, 116, 19-. doi:10.4230/LIPIcs.APPROX-RANDOM.2018.19 https://hdl.handle.net/10356/89386 http://hdl.handle.net/10220/46212 10.4230/LIPIcs.APPROX-RANDOM.2018.19 en Leibniz International Proceedings in Informatics © 2018 Yi Li, Vasileios Nakos, and David P. Woodruff; licensed under Creative Commons License CC-BY. 13 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Heavy Hitters
DRNTU::Science::Mathematics
Sparse Recovery
spellingShingle Heavy Hitters
DRNTU::Science::Mathematics
Sparse Recovery
Li, Yi
Nakos, Vasileios
Woodruff, David P.
On low-risk heavy hitters and sparse recovery schemes
description We study the heavy hitters and related sparse recovery problems in the low failure probability regime. This regime is not well-understood, and the main previous work on this is by Gilbert et al. (ICALP'13). We recognize an error in their analysis, improve their results, and contribute new sparse recovery algorithms, as well as provide upper and lower bounds for the heavy hitters problem with low failure probability. Our results are summarized as follows: 1) (Heavy Hitters) We study three natural variants for finding heavy hitters in the strict turnstile model, where the variant depends on the quality of the desired output. For the weakest variant, we give a randomized algorithm improving the failure probability analysis of the ubiquitous Count-Min data structure. We also give a new lower bound for deterministic schemes, resolving a question about this variant posed in Question 4 in the IITK Workshop on Algorithms for Data Streams (2006). Under the strongest and well-studied l_{infty}/ l_2 variant, we show that the classical Count-Sketch data structure is optimal for very low failure probabilities, which was previously unknown. 2) (Sparse Recovery Algorithms) For non-adaptive sparse-recovery, we give sublinear-time algorithms with low-failure probability, which improve upon Gilbert et al. (ICALP'13). In the adaptive case, we improve the failure probability from a constant by Indyk et al. (FOCS '11) to e^{-k^{0.99}}, where k is the sparsity parameter. 3) (Optimal Average-Case Sparse Recovery Bounds) We give matching upper and lower bounds in all parameters, including the failure probability, for the measurement complexity of the l_2/l_2 sparse recovery problem in the spiked-covariance model, completely settling its complexity in this model.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Li, Yi
Nakos, Vasileios
Woodruff, David P.
format Article
author Li, Yi
Nakos, Vasileios
Woodruff, David P.
author_sort Li, Yi
title On low-risk heavy hitters and sparse recovery schemes
title_short On low-risk heavy hitters and sparse recovery schemes
title_full On low-risk heavy hitters and sparse recovery schemes
title_fullStr On low-risk heavy hitters and sparse recovery schemes
title_full_unstemmed On low-risk heavy hitters and sparse recovery schemes
title_sort on low-risk heavy hitters and sparse recovery schemes
publishDate 2018
url https://hdl.handle.net/10356/89386
http://hdl.handle.net/10220/46212
_version_ 1759855336806678528