Reinforcement Learning from Imperfect Demonstrations
ICLR, Volume abs/1802.05313, 2018.
Robust real-world learning should benefit from both demonstrations and interaction with the environment. Current approaches to learning from demonstration and reward perform supervised learning on expert demonstration data and use reinforcement learning to further improve performance based on reward from the environment. These tasks have ...More
PPT (Upload PPT)