Auto-scaling containerized cloud applications: A workload-driven approach

Simulation Modelling Practice and Theory(2022)

引用 7|浏览1
暂无评分
摘要
Today, cloud computing presents new business opportunities as it offers various technological advantages including elastic computing and efficient pricing strategies. Although, cloud users have access to large amount of resources, it is yet a challenging task to efficiently manage the hardware resources in a cloud environment. In this article, we present PACE (Performance-aware Auto-scaler for Cloud Elasticity), a framework for auto-scaling containerized cloud applications based on workload demand. The framework offers a) reactive auto-scaling using threshold-based rules to avoid application failures during intensive workload tasks and b) proactive auto-scaling using convolutional neural networks (CNN) and K-means to generate elastic scaling policies that incorporate future workload demands. The experimental analysis is based on the Yahoo! Cloud Serving Benchmark (YCSB) executed in Redis containers deployed on the Google Cloud Platform. The proposed framework can automatically adjust cloud resources to satisfy workload demand and ensure Quality of Service (QoS) requirements.
更多
查看译文
关键词
Cloud computing,Elastic scaling,Convolutional neural network,K-means,Time series analysis,Machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要