$s$

Efficient Algorithms for Group Hitting Probability Queries on Large Graphs

IEEE Transactions on Knowledge and Data Engineering(2024)

引用 0|浏览3
暂无评分
摘要
Given a source node $s$ and a target node $t$ , the hitting probability tells us how likely an $\alpha$ -terminating random walk (which stops with probability $\alpha$ at each step) starting from $s$ can hit $t$ before it stops. This concept originates from the hitting time, a classic concept in random walks. In this paper, we focus on the group hitting probability (GHP) where the target is a set of nodes, measuring the node-to-group structural proximity. For this group version of the hitting probability, we present efficient algorithms for two types of GHP queries: the pairwise query which returns the GHP value of a target set $T$ with respect to (w.r.t.) a source node $s$ , and the top- $k$ query which returns the top- $k$ target sets with the largest GHP value w.r.t. a source node $s$ . We first develop an efficient algorithm named SAMBA for the pairwise query, which is built on a group local push algorithm tailored for GHP, with rigorous analysis for correctness. Next, we show how to speed up SAMBA by combining the group local push algorithm with the Monte Carlo approach, where GHP brings new challenges as it might need to consider every hop of the random walk. We tackle this issue with a new formulation of the GHP and show how to provide approximation guarantees with a detailed theoretical analysis. With SAMBA as the backbone, we develop an iterative algorithm for top- $k$ queries, which adaptively refines the bounds for the candidate target sets, and terminates as soon as it meets the stopping condition, thus saving unnecessary computational costs. We further present an optimization technique to accelerate the top- $k$ query, improving its practical performance. Extensive experiments show that our solutions are orders of magnitude faster than their competitors.
更多
查看译文
关键词
Data mining,graph algorithms,graphs and networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要