Nearly Optimal Distinct Elements and Heavy Hitters on Sliding Windows

APPROX-RANDOM(2018)

引用 23|浏览80
暂无评分
摘要
We study the distinct elements and ℓ_p-heavy hitters problems in the sliding window model, where only the most recent n elements in the data stream form the underlying set. We first introduce the composable histogram, a simple twist on the exponential (Datar et al., SODA 2002) and smooth histograms (Braverman and Ostrovsky, FOCS 2007) that may be of independent interest. We then show that the composable histogram along with a careful combination of existing techniques to track either the identity or frequency of a few specific items suffices to obtain algorithms for both distinct elements and ℓ_p-heavy hitters that are nearly optimal in both n and ϵ. Applying our new composable histogram framework, we provide an algorithm that outputs a (1+ϵ)-approximation to the number of distinct elements in the sliding window model and uses 𝒪(1/ϵ^2log nlog1/ϵloglog n+1/ϵlog^2 n) bits of space. For ℓ_p-heavy hitters, we provide an algorithm using space 𝒪(1/ϵ^plog^3 n(loglog n+log1/ϵ)) for 0更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要