Distributed Distributional Deterministic Policy Gradients
ICLR, Volume abs/1804.08617, 2018.
This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we call the Distributed Distributional Deep Deterministic Policy Gradient algorithm, D4PG. We also combin...More
PPT (Upload PPT)