Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Abstract:
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As such, these methods must extensively interact with the environment in order to discover rewards and optimal policies. In most RL applications, however, users have to program the reward function and, hence, there is the opportunity to treat reward funct...More
Code:
Data:
Tags
Comments