${{\sf MEB}}$ ) aims to find a bicliqu"/>

Efficient Maximum Edge-Weighted Biclique Search on Large Bipartite Graphs

IEEE Transactions on Knowledge and Data Engineering(2023)

引用 1|浏览6
暂无评分
摘要
Given a bipartite graph, the maximum edge biclique problem ( ${{\sf MEB}}$ ) aims to find a biclique with the largest number of edges. ${{\sf MEB}}$ is a fundamental problem with many real applications, such as community analysis, E-commerce services and bioinformatics. However, in some scenarios, the weight of an edge reflects valuable and important information on the relationship between two entities. Motivated by this, in this paper, we investigate the problem of maximum edge-weighted biclique search ( ${{\sf MEWB}}$ ), which finds a biclique with the largest total weight of edges in a weighted bipartite graph. ${{\sf MEWB}}$ has many real applications, including item recommendation, fraud detection, gene clustering, etc. Although we show that ${{\sf MEWB}}$ can be resolved by adapting the search algorithm designed for ${{\sf MEB}}$ , the performance of this method is yet unsatisfactory. To improve the computation efficiency, two optimizations in terms of upper bound and search order are proposed. For the upper bound, we consider the degree distribution for vertices in the candidate set, and thus have a chance to discard a few edges to tighten the upper bound. For search order, we theoretically show that a vertex order generating the most similar search depth on vertices can achieve the least time cost for ${{\sf MEWB}}$ . Guided by this fact, we propose the global summation vertex order. To further accelerate the computation, we extend our approach to a parallel environment, and develop a heuristic approach to deal with large-scale graphs by slightly sacrificing the answer quality. Extensive performance studies conducted on real datasets demonstrate that our proposals can significantly outperform the baseline method by up to two orders of magnitude. Besides, our heuristic approach gives the optimal result on 8 out of 10 real datasets, while achieving more than an order of magnitude of speed-up.
更多
查看译文
关键词
Maximum edge-weighted biclique, upper bound, vertex order, heuristic, parallelization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要