Semi-supervised reward learning for offline reinforcement learning
Abstract:
In offline reinforcement learning (RL) agents are trained using a logged dataset. It appears to be the most natural route to attack real-life applications because in domains such as healthcare and robotics interactions with the environment are either expensive or unethical. Training agents usually requires reward functions, but unfortun...More
Code:
Data:
Tags
Comments