Wasserstein Reinforcement Learning
CoRR, 2019.
EI
Abstract:
We propose behavior-driven optimization via Wasserstein distances (WDs) to improve several classes of state-of-the-art reinforcement learning (RL) algorithms. We show that WD regularizers acting on appropriate policy embeddings efficiently incorporate behavioral characteristics into policy optimization. We demonstrate that they improve ...More
Code:
Data:
Tags
Comments