Renewable Energy-Aware Big Data Analytics in Geo-distributed Data Centers with Reinforcement Learning

IEEE Transactions on Network Science and Engineering(2020)

引用 80|浏览47
暂无评分
摘要
In the age of big data, companies tend to deploy their services in data centers rather than their own servers. The demands of big data analytics grow significantly, which leads to an extremely high electricity consumption at data centers. In this paper, we investigate the cost minimization problem of big data analytics on geo-distributed data centers connected to renewable energy sources with unpredictable capacity. To solve this problem, we propose a Reinforcement Learning (RL) based job scheduling algorithm by combining RL with neural network (NN). Moreover, two techniques are developed to enhance the performance of our proposal. Specifically, Random Pool Sampling (RPS) is proposed to retrain the NN via accumulated training data, and a novel Unidirectional Bridge Network (UBN) structure is designed for further enhancing the training speed by using the historical knowledge stored in the trained NN. Experiment results on real Google cluster traces and electricity price from Energy Information Administration show that our approach is able to reduce the data centersu0027 cost significantly compared with other benchmark algorithms.
更多
查看译文
关键词
Data centers,Renewable energy sources,Big Data,Artificial neural networks,Scheduling,Energy consumption,Green products
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要