Reinforced Imitation in Heterogeneous Action SpaceEI
arXiv: Learning, 2019.
Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we consider a challenging setting where an agent and an expert use different actions from each other. We assume that the agent has access to a sparse reward function and state-only expert observations. We propose a ...More