The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning

2020 25th International Conference on Pattern Recognition (ICPR)(2020)

引用 18|浏览9
暂无评分
摘要
Multi-step (also called n-step) methods in Reinforcement Learning (RL) have been shown to be more efficient than the 1-step method due to faster propagation of the reward signal, both theoretically and empirically, in tasks exploiting tabular representation of the value-function. Recently, research in Deep Reinforcement Learning (DRL) also shows that multi-step methods improve learning speed and f...
更多
查看译文
关键词
Computational modeling,Neural networks,Buildings,Reinforcement learning,Approximation error,Approximation algorithms,Pattern recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要