Capacity planning for dependable services

International Symposium on Stabilization, Safety, and Security of Distributed Systems (SSS)(2023)

引用 0|浏览36
暂无评分
摘要
Fault-tolerance techniques depend on replication to enhance availability, albeit at the cost of increased infrastructure costs. This results in a fundamental trade-off: Fault-tolerant services must satisfy given availability and performance constraints while minimising the number of replicated resources. These constraints pose capacity planningchallenges for the service operators to minimise replication costs without negatively impacting availability. To this end, we presentPCRAFT(Performant, Cheap, Reliable andAvailableFaultTolerance), a practical process to enable capacity planning of dependable services. PCRAFT's capacity planning process is based on a hybrid approach that combines empirical performance measurements with probabilistic modelling of availability based on fault injection. In particular, we integrate traditional service-level availability mechanisms (active-routeanywhere and passive-failover) and deployment schemes (cloud and on-premises) to quantify the number of nodes needed to satisfy the given availability and performance constraints. Our evaluation based on real-world applications shows that cloud deployment requires fewer nodes than on-premises deployments. Additionally, when considering onpremises deployments, we show how passive-failover requires fewer nodes than activeroute-anywhere. Furthermore, our evaluation quantifies the quality enhancement given by additional integrity mechanisms and how this affects the number of nodes needed. (c) 2023 Elsevier B.V. All rights reserved.
更多
查看译文
关键词
Dependability,Capacity planning,Modelling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要