Twister2: TSet High-Performance Iterative Dataflow

2019 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS)(2019)

引用 13|浏览60
暂无评分
摘要
The dataflow model is gradually becoming the de facto standard for big data applications. While many popular frameworks are built around this model, very little research has been done on understanding its inner workings, which in turn has led to inefficiencies in existing frameworks. It is important to note that understanding the relationship between dataflow and HPC building blocks allows us to address and alleviate many of these fundamental inefficiencies by learning from the extensive research literature in the HPC community. In this paper we present TSet’s, the dataflow abstraction of Twister2, which is a big data framework designed for high-performance dataflow and iterative computations. We discuss the dataflow model adopted by TSet’s and the rationale behind implementing iteration handling at the worker level. Finally, we evaluate TSet’s to show the performance of the framework.
更多
查看译文
关键词
Task analysis,Data models,Programming,Big Data,Sparks,Computational modeling,Data analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要