One-Pass Wavelet Synopses for Maximum-Error Metrics.

VLDB '05: Proceedings of the 31st international conference on Very large data bases(2005)

引用 102|浏览11
暂无评分
摘要
We study the problem of computing wavelet-based synopses for massive data sets in static and streaming environments. A compact representation of a data set is obtained after a thresholding process is applied on the coefficients of its wavelet decomposition. Existing polynomial-time thresholding schemes that minimize maximum error metrics are disadvantaged by impracticable time and space complexities and are not applicable in a data stream context. This is a cardinal issue, as the problem at hand in its most practically interesting form involves the time-efficient approximation of huge amounts of data, potentially in a streaming environment. In this paper we fill this gap by developing efficient and practicable wavelet thresholding algorithms for maximum-error metrics, for both a static and a streaming case. Our algorithms achieve near-optimal accuracy and superior runtime performance, as our experiments show, under frugal space requirements in both contexts.
更多
查看译文
关键词
data set,data stream context,massive data set,polynomial-time thresholding scheme,thresholding process,frugal space requirement,maximum error metrics,maximum-error metrics,practicable wavelet,space complexity,One-pass wavelet synopsis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要