Hyperparameter Selection for Offline Reinforcement Learning

Cited by: 0|Bibtex|Views52|Links

Abstract:

Offline reinforcement learning (RL purely from logged data) is an important avenue for deploying RL techniques in real-world scenarios. However, existing hyperparameter selection methods for offline RL break the offline assumption by evaluating policies corresponding to each hyperparameter setting in the environment. This online executi...More

Code:

Data:

Your rating :
0

 

Tags
Comments