Scene Memory Transformer for Embodied Agents in Long Time Horizon Tasks

user-5ebe28134c775eda72abcdca(2019)

引用 1|浏览139
暂无评分
摘要
Many robotic applications require a policy to perform tasks over a long time horizon in large environments. In such applications, decision making at any step can depend on states observed far in the past. Hence, being able to properly memorize past observation is crucial. In this work we bring recent advances in neural language understanding~\cite {Vaswani2017AttentionIA} to robotics. We propose a novel memory-based policy, called Scene Memory Transformer (SMT). This model is generic, makes no assumptions about the concrete application, and can be efficiently trained with Reinforcement Learning over long episodes. On a range of challenging navigation tasks, SMT demonstrates superior performance to other established stateful models by a margin over long episodes. We show that the proposed model is robust to noise and can utilize long-term dependencies in its memory. Videos and supplementary …
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要