Efficient discovery of frequent subgraph patterns in uncertain graph databases

EDBT/ICDT '11: Proceedings of the 14th International Conference on Extending Database Technology(2011)

引用 58|浏览0
暂无评分
摘要
Mining frequent subgraph patterns in graph databases is a challenging and important problem with applications in several domains. Recently, there is a growing interest in generalizing the problem to uncertain graphs, which can model the inherent uncertainty in the data of many applications. The main difficulty in solving this problem results from the large number of candidate subgraph patterns to be examined and the large number of subgraph isomorphism tests required to find the graphs that contain a given pattern. The latter becomes even more challenging, when dealing with uncertain graphs. In this paper, we propose a method that uses an index of the uncertain graph database to reduce the number of comparisons needed to find frequent subgraph patterns. The proposed algorithm relies on the apriori property for enumerating candidate subgraph patterns efficiently. Then, the index is used to reduce the number of comparisons required for computing the expected support of each candidate pattern. It also enables additional optimizations with respect to scheduling and early termination, that further increase the efficiency of the method. The evaluation of our approach on three real-world datasets as well as on synthetic uncertain graph databases demonstrates the significant cost savings with respect to the state-of-the-art approach.
更多
查看译文
关键词
subgraph isomorphism test,candidate pattern,large number,efficient discovery,frequent subgraph pattern,candidate subgraph pattern,uncertain graph database,graph databases,enumerating candidate subgraph pattern,synthetic uncertain graph databases,uncertain graph,clustering,directed graphs,indexation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要