Top-K Query Processing In Probabilistic Databases With Non-Materialized Views

ICDE '13 Proceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013)(2013)

引用 46|浏览3
暂无评分
摘要
We investigate a novel approach of computing confidence bounds for top-k ranking queries in probabilistic databases with non-materialized views. Unlike related approaches, we present an exact pruning algorithm for finding the top-ranked query answers according to their marginal probabilities without the need to first materialize all answer candidates via the views. Specifically, we consider conjunctive queries over multiple levels of select-project-join views, the latter of which are cast into Datalog rules which we ground in a top-down fashion directly at query processing time. To our knowledge, this work is the first to address integrated data and confidence computations for intensional query evaluations in the context of probabilistic databases by considering confidence bounds over first-order lineage formulas. We extend our query processing techniques by a tool-suite of scheduling strategies based on selectivity estimation and the expected impact on confidence bounds. Further extensions to our query processing strategies include improved top-k bounds in the case when sorted relations are available as input, as well as the consideration of recursive rules. Experiments with large datasets demonstrate significant runtime improvements of our approach compared to both exact and sampling-based top-k methods over probabilistic data.
更多
查看译文
关键词
confidence bound,probabilistic databases,conjunctive query,intensional query evaluation,query processing strategy,query processing technique,query processing time,top-k ranking query,top-ranked query,confidence computation,Top-k query processing,non-materialized view
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要