Adaptive Trade-Offs in Off-Policy Learning
AISTATS, pp. 34-44, 2019.
A great variety of off-policy learning algorithms exist in the literature, and new breakthroughs in this area continue to be made, improving theoretical understanding and yielding state-of-the-art reinforcement learning algorithms. In this paper, we take a unifying view of this space of algorithms, and consider their trade-offs of three...More
PPT (Upload PPT)