Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term FutureEI

    Amanpreet Singh
    Amanpreet Singh
    Ahmed Touati
    Ahmed Touati
    Cited by: 0|Bibtex|51|

    arXiv: Machine Learning, 2019.

    Abstract:

    In model-based reinforcement learning, the agent interleaves between model learning and planning. These two components are inextricably intertwined. If the model is not able to provide sensible long-term prediction, the executed planner would exploit model flaws, which can yield catastrophic failures. This paper focuses on building a mode...More
    Your rating :
    0

     

    Tags
    Comments