Recovering A Hidden Community Beyond The Kesten-Stigum Threshold In O(Vertical Bar E Vertical Bar Log* Vertical Bar V Vertical Bar) Time

JOURNAL OF APPLIED PROBABILITY(2018)

引用 8|浏览18
暂无评分
摘要
Community detection is considered for a stochastic block model graph of n vertices, with K vertices in the planted community, edge probability p for pairs of vertices both in the community, and edge probability q for other pairs of vertices. The main focus of the paper is on weak recovery of the community based on the graph G, with o(K) misclassified vertices on average, in the sublinear regime n(1-o(1)) <= K <= o(n). A critical parameter is the effective signal-to-noise ratio lambda = K-2(p - q)(2)/ ((n - K) q), with lambda = 1 corresponding to the Kesten-Stigum threshold. We show that a belief propagation (BP) algorithm achieves weak recovery if lambda > 1/e, beyond the Kesten-Stigum threshold by a factor of 1/e. The BP algorithm only needs to run for log* n + O(1) iterations, with the total time complexity O(vertical bar E vertical bar log* n), where log* n is the iterated logarithm of n. Conversely, if lambda <= 1/e, no local algorithm can asymptotically outperform trivial random guessing. Furthermore, a linear message-passing algorithm that corresponds to applying a power iteration to the nonbacktracking matrix of the graph is shown to attain weak recovery if and only if lambda > 1. In addition, the BP algorithm can be combined with a linear-time voting procedure to achieve the information limit of exact recovery (correctly classify all vertices with high probability) for all K >= (n/ log n)( rho(BP) + o(1)), where rho(BP) is a function of p/q.
更多
查看译文
关键词
Hidden community, belief propagation, message passing, spectral algorithms, high-dimensional statistics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要