Truly Perfect Samplers for Data Streams and Sliding Windows [article]

Rajesh Jayaram, David P. Woodruff, Samson Zhou
<span title="2021-08-26">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In the G-sampling problem, the goal is to output an index i of a vector f ∈ℝ^n, such that for all coordinates j ∈ [n], Pr[i=j] = (1 ±ϵ) G(f_j)/∑_k∈[n] G(f_k) + γ, where G:ℝ→ℝ_≥ 0 is some non-negative function. If ϵ = 0 and γ = 1/poly(n), the sampler is called perfect. In the data stream model, f is defined implicitly by a sequence of updates to its coordinates, and the goal is to design such a sampler in small space. Jayaram and Woodruff (FOCS 2018) gave the first perfect L_p samplers in
more &raquo; ... le streams, where G(x)=|x|^p, using polylog(n) space for p∈(0,2]. However, to date all known sampling algorithms are not truly perfect, since their output distribution is only point-wise γ = 1/poly(n) close to the true distribution. This small error can be significant when samplers are run many times on successive portions of a stream, and leak potentially sensitive information about the data stream. In this work, we initiate the study of truly perfect samplers, with ϵ = γ = 0, and comprehensively investigate their complexity in the data stream and sliding window models. Abstract truncated due to arXiv limits; please see paper for full abstract.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.12017v1">arXiv:2108.12017v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/urlnlecdwfam5il6mdac6b7ywa">fatcat:urlnlecdwfam5il6mdac6b7ywa</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210906052116/https://arxiv.org/pdf/2108.12017v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3a/b1/3ab1d9738411fc188494a98b345cb169d5bc893c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.12017v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>