Semi-supervised reward learning for offline reinforcement learning

Cited by: 0|Bibtex|Views15|Links

Abstract:

In offline reinforcement learning (RL) agents are trained using a logged dataset. It appears to be the most natural route to attack real-life applications because in domains such as healthcare and robotics interactions with the environment are either expensive or unethical. Training agents usually requires reward functions, but unfortun...More

Code:

Data:

Your rating :
0

 

Tags
Comments