Value activation for bias alleviation: Generalized-activated deep double deterministic policy gradients

Neurocomputing(2023)

引用 1|浏览28
暂无评分
摘要
•We propose a novel generalized-activated weighting operator for bias alleviation in deep reinforcement learning.•We show theoretically and experimentally that generalized-activated weighting operator helps alleviate both underestimation bias and overestimation bias.•We find that simple activation functions are enough for amazing performance without any tricks and special design for activation function.
更多
查看译文
关键词
Reinforcement learning,Estimation bias,Activation function,Continuous control
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要