Regioned Episodic Reinforcement Learning

Jiarui Jin,Cong Chen,Ming Zhou,Weinan Zhang,Rasool Fakoor,David Wipf,Yong Yu,Jun Wang,Alex Smola

user-5fe1a78c4c775e6ec07359f9（2021）

引用 0|浏览354

暂无评分

摘要

Goal-oriented reinforcement learning algorithms are often good at exploration, not exploitation, while episodic algorithms excel at exploitation, not exploration. As a result, neither of these approaches alone can lead to a sample efficient algorithm in complex environments with high dimensional state space and delayed rewards. Motivated by these observations and shortcomings, in this paper, we introduce Regioned Episodic Reinforcement Learning (RERL) that combines the episodic and goal-oriented learning strengths and leads to a more sample efficient and ef- fective algorithm. RERL achieves this by decomposing the space into several sub-space regions and constructing regions that lead to more effective exploration and high values trajectories. Extensive experiments on various benchmark tasks show that RERL outperforms existing methods in terms of sample efficiency and final rewards.

查看译文

关键词

Reinforcement learning,State space,Episodic memory,Machine learning,Computer science,Artificial intelligence,Efficient algorithm,High dimensional

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要