Evolutionary Stochastic Policy Distillation

Cited by: 0|Bibtex|Views42|Links

Abstract:

Solving the Goal-Conditioned Reward Sparse (GCRS) task is a challenging reinforcement learning problem due to the sparsity of reward signals. In this work, we propose a new formulation of GCRS tasks from the perspective of the drifted random walk on the state space, and design a novel method called Evolutionary Stochastic Policy Distill...More

Code:

Data:

Your rating :
0

 

Tags
Comments