Self-Paced Prioritized Curriculum Learning With Coverage Penalty in Deep Reinforcement Learning.

IEEE Transactions on Neural Networks and Learning Systems(2018)

引用 109|浏览24
暂无评分
摘要
In this paper, a new training paradigm is proposed for deep reinforcement learning using self-paced prioritized curriculum learning with coverage penalty. The proposed deep curriculum reinforcement learning (DCRL) takes the most advantage of experience replay by adaptively selecting appropriate transitions from replay memory based on the complexity of each transition. The criteria of complexity in...
更多
查看译文
关键词
Training,Learning (artificial intelligence),Machine learning,Complexity theory,Training data,Games,Robustness
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要