Hyperbolic Discounting and Learning over Multiple Horizons

    William Fedus
    William Fedus
    Carles Gelada
    Carles Gelada

    arXiv: Machine Learning, 2019.

    Cited by: 0|Bibtex|Views106|Links
    EI

    Abstract:

    Reinforcement learning (RL) typically defines a discount factor as part of the Markov Decision Process. The discount factor values future rewards by an exponential scheme that leads to theoretical convergence guarantees of the Bellman equation. However, evidence from psychology, economics and neuroscience suggests that humans and animals ...More

    Code:

    Data:

    Your rating :
    0

     

    Tags
    Comments