Database-support for continuous prediction queries over streaming data

PVLDB(2010)

引用 17|浏览49
暂无评分
摘要
Prediction is emerging as an essential ingredient for real-time monitoring, planning and decision support applications such as intrusion detection, e-commerce pricing and automated resource management. This paper presents a system that efficiently supports continuous prediction queries (CPQs) over streaming data using seamlessly-integrated probabilistic models. Specifically, we describe how to execute and optimize CPQs using discrete (Dynamic) Bayesian Networks as the underlying predictive model. Our primary contribution is a novel cost-based optimization framework that employs materialization, sharing, and model-specific optimization techniques to enable highly-efficient point- and range-based CPQ execution. Furthermore, we support efficient execution of top-k and threshold-based high probability queries. We characterize the behavior of our system and demonstrate significant performance gains using a prototype implementation operating on real-world network intrusion data and deployed as part of a real-time software-performance monitoring system.
更多
查看译文
关键词
real-time monitoring,optimize cpqs,optimization framework,real-time software-performance monitoring system,model-specific optimization technique,decision support application,efficient execution,range-based cpq execution,continuous prediction query,intrusion detection,decision support,probabilistic model,software performance,model specification,dynamic bayesian network,prediction model,real time,e commerce
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要