Accelerated Deep Reinforcement Learning for Wireless Coded Caching
2019 IEEE/CIC International Conference on Communications in China (ICCC)(2019)
摘要
Coded caching is effective in leveraging the accumulated storage size in wireless networks by distributing different coded segments of a file in different cache nodes. This paper aims to find a wireless coded caching policy to minimize the long-term network cost, which involves both transmission delay and cache replacement cost, using tools from machine learning. The problem is known to be challenging due to the unknown content popularity and the continuous, high-dimensional action space. We first predict the number of file requests using historical information. Based on the predicted results, we then propose a supervised deep deterministic policy gradient (SDDPG) approach. This approach, on one hand, can learn the caching policy in continuous action space by using the actor network to make cache decision and the critic network to evaluate cache decision. On the other hand, it accelerates the learning process by pre-training the actor network with the results of an approximate problem that minimizes the per-slot cost. Real-world trace-based numerical results show that the proposed SDDPG approach is effective.
更多查看译文
关键词
accelerated deep reinforcement,wireless networks,wireless coded caching policy,long-term network cost,cache replacement cost,machine learning,high-dimensional action space,supervised deep deterministic policy gradient approach,continuous action space,actor network,cache decision,learning process,storage size,cache node,coded segments,SDDPG approach
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络