Distributed Distributional Deterministic Policy Gradients

ICLR, Volume abs/1804.08617, 2018.

Cited by: 172|Bibtex|Views101|Links
EI

Abstract:

This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we call the Distributed Distributional Deep Deterministic Policy Gradient algorithm, D4PG. We also combin...More

Code:

Data:

Your rating :
0

 

Tags
Comments