Distributed Distributional Deterministic Policy Gradients
ICLR, Volume abs/1804.08617, 2018.
EI
Abstract:
This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we call the Distributed Distributional Deep Deterministic Policy Gradient algorithm, D4PG. We also combin...More
Code:
Data:
Tags
Comments