Wasserstein Reinforcement Learning

CoRR, 2019.

Cited by: 0|Views32
EI

Abstract:

We propose behavior-driven optimization via Wasserstein distances (WDs) to improve several classes of state-of-the-art reinforcement learning (RL) algorithms. We show that WD regularizers acting on appropriate policy embeddings efficiently incorporate behavioral characteristics into policy optimization. We demonstrate that they improve ...More

Code:

Data:

Full Text
Bibtex
Your rating :
0

 

Tags
Comments