SWIRL: A SequentialWindowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards.Sanjay Krishnan,Animesh Garg,Richard Liaw,Brijen Thananjeyan,Lauren Miller,Florian T. Pokorny,Ken GoldbergWAFR(2016)引用 29|浏览32暂无评分关键词reinforcement learning,robot tasks,delayed rewards,inverseAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要