GOLGI: Performance-Aware, Resource-Efficient Function Scheduling for Serverless Computing

Suyi Li,Wei Wang, Jun Yang, Guangzhen Chen, Daohe Lu

PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023(2023)

引用 0|浏览7
暂无评分
摘要
This paper introduces GOLGI, a novel scheduling system designed for serverless functions, with the goal of minimizing resource provisioning costs while meeting the function latency requirements. To achieve this, GOLGI judiciously over-commits functions based on their past resource usage. To ensure overcommitment does not cause significant performance degradation, GOLGI identifies nine low-level metrics to capture the runtime performance of functions, encompassing factors like request load, resource allocation, and contention on shared resources. These metrics enable accurate prediction of function performance using the Mondrian Forest, a classification model that is continuously updated in real-time for optimal accuracy without extensive offline training. GOLGI employs a conservative exploration-exploitation strategy for request routing. By default, it routes requests to non-overcommitted instances to ensure satisfactory performance. However, it actively explores opportunities for using more resource-efficient overcommitted instances, while maintaining the specified latency SLOs. Golgi also performs vertical scaling to dynamically adjust the concurrency of overcommitted instances, maximizing request throughput and enhancing system robustness to prediction errors. We have prototyped GOLGI and evaluated it in both EC2 cluster and a small production cluster. The results show that GOLGI can meet the SLOs while reducing the resource provisioning cost by 42% (30%) in EC2 cluster (our production cluster).
更多
查看译文
关键词
Serverless Computing,Resource Management,Scheduling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要