Energy-Efficient GPU-Intensive Workload Scheduling for Data Centers.

Matthew Smith, Luke Zhao, Jonathan Cordova,Xunfei Jiang, Mahdi Ebrahimi

International Conference on Machine Learning and Applications(2023)

引用 0|浏览0
暂无评分
摘要
Cooling costs count for a significant part of the total energy consumption in data centers, and previous re-searchers mainly focused on investigating thermal-ware workload distribution strategies for CPU-intensive workloads. This paper introduces a novel machine learning-based approach that aims at reducing energy consumption through thermal-aware workload distribution to build energy-efficient data centers for GPU-intensive workload. To achieve this goal, the study employs the GpuCloudSim Plus simulator, which effectively models the dis-tribution of GPU-intensive applications under diverse workloads and utilizations. The integration of machine learning models allows for accurate temperature predictions and comprehensive evaluation of the proposed algorithm's performance. We pro-posed a new workload scheduling algorithm, ThermalAwareGpu, to reduce the energy cost for GPU-intensive workload. We evaluated our algorithm by generating three common patterns of workloads, and saved up to 12.79 % of computing cost compared to the baseline algorithms. Our future work includes exploring the estimation of data center cooling energy and conducting in-depth comparisons of different workload balancing algorithms on various compute-intensive workloads.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要