Fast State Machine Replication for Cloud Services

semanticscholar(2019)

引用 0|浏览0
暂无评分
摘要
SAGAR JHA, Cornell University, USA JONATHAN BEHRENS, Cornell University, USA and MIT, USA THEO GKOUNTOUVAS, Cornell University, USA MATTHEW MILANO, Cornell University, USA WEIJIA SONG, Cornell University, USA EDWARD TREMEL, Cornell University, USA SYDNEY ZINK, Cornell University, USA KENNETH P. BIRMAN, Cornell University, USA ROBBERT VAN RENESSE, Cornell University, USA Cloud computing services often replicate data and may require ways to coordinate distributed actions. Here we present Derecho, a library for such tasks. The API provides interfaces for structuring applications into patterns of subgroups and shards, supports state machine replication within them, and includes mechanisms that assist in restart after failures. Running over 100Gbps RDMA, Derecho can send millions of events per second in each subgroup or shard and throughput peaks at 16GB/s, substantially outperforming prior solutions. Configured to run purely on TCP, Derecho is still substantially faster than comparable widely used, highly-tuned, standard tools. The key insight is that on modern hardware (including non-RDMA networks), data-intensive protocols should be built from non-blocking data-flow components. CCS Concepts: • Computer systems organization→ Dependable and fault tolerant systems and networks; • Software and its engineering→ Cloud computing; Additional
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要