gIceberg: Towards iceberg analysis in large graphs

ICDE '13 Proceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013)(2013)

引用 20|浏览2
暂无评分
摘要
Traditional multi-dimensional data analysis techniques such as iceberg cube cannot be directly applied to graphs for finding interesting or anomalous vertices due to the lack of dimensionality in graphs. In this paper, we introduce the concept of graph icebergs that refer to vertices for which the concentration (aggregation) of an attribute in their vicinities is abnormally high. Intuitively, these vertices shall be “close” to the attribute of interest in the graph space. Based on this intuition, we propose a novel framework, called gIceberg, which performs aggregation using random walks, rather than traditional SUM and AVG aggregate functions. This proposed framework scores vertices by their different levels of interestingness and finds important vertices that meet a user-specified threshold. To improve scalability, two aggregation strategies, forward and backward aggregation, are proposed with corresponding optimization techniques and bounds. Experiments on both real-world and synthetic large graphs demonstrate that gIceberg is effective and scalable.
更多
查看译文
关键词
aggregation strategy,graph iceberg,graph space,novel framework,proposed framework scores vertex,synthetic large graph,traditional SUM,traditional multi-dimensional data analysis,AVG aggregate function,abnormally high,Towards iceberg analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要