Fast discovery of unexpected patterns in data, relative to a Bayesian network

KDD(2005)

引用 69|浏览44
暂无评分
摘要
We consider a model in which background knowledge on a given domain of interest is available in terms of a Bayesian network, in addition to a large database. The mining problem is to discover unexpected patterns: our goal is to find the strongest discrepancies between network and database. This problem is intrinsically difficult because it requires inference in a Bayesian network and processing the entire, potentially very large, database. A sampling-based method that we introduce is efficient and yet provably finds the approximately most interesting unexpected patterns. We give a rigorous proof of the method's correctness. Experiments shed light on its efficiency and practicality for large-scale Bayesian networks and databases.
更多
查看译文
关键词
strongest discrepancy,large database,sampling-based method,bayesian network,rigorous proof,unexpected pattern,mining problem,fast discovery,large-scale bayesian network,interesting unexpected pattern,bayesian networks,very large database,association rules,sampling,association rule
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要