Sensitivity Analysis of Answer Ordering from Probabilistic Databases.

DEXA (2)(2013)

引用 5|浏览30
暂无评分
摘要
Queries over probabilistic databases result in probabilistic answers, which are often ranked according to certain ranking criteria. As the probabilities of the basic tuples may be imprecise and erroneous, and their perturbations may lead to great changes in answer ordering, sensitivity analysis like "which basic input probability change can substantially alter the ranked result?", "which basic probability change will make a certain element top-ranked?", "which basic probability change will swap the positions of the firstly and secondly ranked elements?" thus arise. The sensitivity analysis of top-K probabilistic query has been touched in the literature, mainly concerning the change of the answer list as a set. However, the ordering of the elements in the answer list matters highly for certain applications. In this paper, we categorize a variety of such kinds of ordering sensitivity questions into list-oriented or element-oriented, and formulate the sensitivity analysis problem for answer ordering returned from probabilistic top-K queries and probabilistic top-K aggregation queries. We develop a modular approach to quantitatively compute sensitivity of answer ordering, where four basic processing modules are identified. Optimization strategies are also presented for performance improvement. Experimental results on both synthetic and real data demonstrate the effectiveness and efficiency of the proposed solutions. © 2013 Springer-Verlag.
更多
查看译文
关键词
aggregation query,answer ordering,probabilistic database,sensitivity analysis,top-k query
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要