Meta-Q-Learning

    Cited by: 0|Bibtex|46|

    International Conference on Learning Representations, 2020.

    Keywords:
    meta reinforcement learning propensity estimation off-policy

    Abstract:

    This paper introduces Meta-Q-Learning (MQL), a new off-policy algorithm for meta-Reinforcement Learning (meta-RL). MQL builds upon three simple ideas. First, we show that Q-learning is competitive with state-of-the-art meta-RL algorithms if given access to a context variable that is a representation of the past trajectory. Second, a multi...More
    Your rating :
    0

     

    Tags
    Comments