RStream: Marrying Relational Algebra with Streaming for Efficient Graph Mining on A Single Machine.

OSDI(2018)

引用 110|浏览94
暂无评分
摘要
Graph mining is an important category of graph algorithms that aim to discover structural patterns such as cliques and motifs in a graph. While a great deal of work has been done recently on graph computation such as PageRank, systems support for scalable graph mining is still limited. Existing mining systems such as Arabesque focus on distributed computing and need large amounts of compute and memory resources. We built RStream, the first single-machine, out-of-core mining system that leverages disk support to store intermediate data. At its core are two innovations: (1) a rich programming model that exposes relational algebra for developers to express a wide variety of mining tasks; and (2) a runtime engine that implements relational algebra efficiently with tuple streaming. A comparison between RStream and four state-of-the-art distributed mining/Datalog systems- Arabesque, ScaleMine, DistGraph, and BigDatalog - demonstrates that RStream outperforms all of them, running on a 10-node cluster, e.g., by at least a factor of 1.7x, and can process large graphs on an inexpensive machine.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要