Benchmarks for Deep Off-Policy Evaluation

international conference on learning representations, 2021.

被引用0|浏览55
微博一下
A benchmark proposal for off-policy evaluation and policy selection.

摘要

Off-policy evaluation (OPE) holds the promise of being able to leverage large, offline datasets for both obtaining and selecting complex policies for decision making. The ability to perform evaluation offline is particularly important in many real-world domains, such as healthcare, recommender systems, or robotics, where online data colle...更多

代码

数据

下载 PDF 全文
引用
微博一下
您的评分 :
0

 

标签
评论