HeavySeparation: A Generic Framework for Stream Processing Faster and More Accurate

Computer Communications(2024)

引用 0|浏览2
暂无评分
摘要
Sketch as a probability data structure has been widely used in high volume, fast data streams. At the cost of a tiny accuracy in frequency estimation, it achieves a high speed with small memory usage. However, skewed data streams pose a significant challenge for existing sketches in terms of accuracy and speed using limited memory. To address this issue, we proposed a framework, called HeavySeparation, to enhance existing sketches by filtering elephant flow efficiently and accurately. We adopt a power-weakening increment strategy to allow sufficient competition in the early stages of identifying elephant flows and amplifying relative advantage when the frequency of candidate flow is large. To verify the effectiveness and efficiency of our framework, we apply the framework to two typical sketches and two common stream processing tasks. Results show that HeavySeparation framework reduces the error by around 1∼2 orders of magnitude on average compared to the state-of-the-art in frequency estimation.
更多
查看译文
关键词
Traffic Measurement,Approximate Algorithms,Sketch
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要