A Multi-Step Reinforcement Learning Algorithm

Applied Mechanics and Materials(2011)

引用 3|浏览6
暂无评分
摘要
Reinforcement learning (RL) is a state or action value based machine learning method which approximately solves large-scale Markov Decision Process (MDP) or Semi-Markov Decision Process (SMDP). A multi-step RL algorithm called Sarsa(lambda,k) is proposed, which is a compromised variation of Sarsa and Sarsa(lambda). It is equivalent to Sarsa if k is 1 and is equivalent to Sarsa(lambda) if k is infinite. Sarsa(lambda,k) adjust its performance by setting k value. Two forms of Sarsa(lambda,k), forward view Sarsa(lambda,k) and backward view Sarsa(lambda,k), are constructed and proved equivalent in off-line updating.
更多
查看译文
关键词
Reinforcement learning,Sarsa(lambda,k),Sarsa,Sarsa(lambda)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要