Fast computation of distance-generalized cores using sampling

2021 IEEE International Conference on Data Mining (ICDM)(2023)

引用 0|浏览10
暂无评分
摘要
Core decomposition is a classic technique for discovering densely connected regions in a graph with large range of applications. Formally, a k -core is a maximal subgraph where each vertex has at least k neighbors. A natural extension of a k -core is a ( k , h )-core, where each node must have at least k nodes that can be reached with a path of length h . The downside in using ( k , h )-core decomposition is the significant increase in the computational complexity: whereas the standard core decomposition can be done in 𝒪( m) time, the generalization can require 𝒪( n^2m) time, where n and m are the number of nodes and edges in the given graph. In this paper, we propose a randomized algorithm that produces an ϵ -approximation of ( k , h ) core decomposition with a probability of 1 - δ in 𝒪( ϵ ^-2 hm (log ^2 n - logδ )) time. The approximation is based on sampling the neighborhoods of nodes, and we use Chernoff bound to prove the approximation guarantee. We also study distance-generalized dense subgraphs, show that the problem is NP -hard, provide an algorithm for discovering such graphs with approximate core decompositions, and provide theoretical guarantees for the quality of the discovered subgraphs. We demonstrate empirically that approximating the decomposition complements the exact computation: computing the approximation is significantly faster than computing the exact solution for the networks where computing the exact solution is slow.
更多
查看译文
关键词
Distance-generalized core decomposition,Sampling,Approximation algorithm,Chernoff bounds,Dense subgraph
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要