Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams

Donghui Chen, Ling Chen

Information Sciences(2020)

引用 4|浏览53
暂无评分
摘要
Uncertain data streams are ubiquitous in many sensing and networking environments. Probabilistic aggregate query that returns a probability distribution to denote possible answers is extensively used on such streams. In many monitoring applications, it is only necessary to know whether the result distribution exceeds user-defined thresholds. In this paper, we formalize two important query types: sliding-window probabilistic threshold sum query and sliding-window probabilistic threshold count query, which introduce two thresholds (probability and score) into the probability distribution. An intuition solution is to use existing probabilistic aggregate algorithms to obtain the probability distribution and then apply the thresholds to this probability distribution. However, this solution separates the threshold processing from query processing and results in low efficiency. Our work uses Gaussian mixture models to represent uncertain data. Based on the Gaussian properties and probability theory of this model, we design efficient algorithms to answer these queries, which include filtering strategies and exact calculations. Several techniques (e.g., characteristic function, incremental calculation, pruning strategy, and state transition equation) are integrated into the exact calculations to improve time and space efficiency. Experiments on real and synthetic datasets demonstrate that our algorithms outperform existing algorithms.
更多
查看译文
关键词
Uncertain data,Sliding-window,Probabilistic aggregate queries,Probabilistic threshold queries
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要