Robust Trajectory Selection For Rearrangement Planning As A Multi-Armed Bandit Problem

Michael C. Koval,Jennifer E. King,Nancy S. Pollard,Siddhartha S. Srinivasa

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)（2015）

引用 44|浏览88

暂无评分

摘要

We present an algorithm for generating open-loop trajectories that solve the problem of rearrangement planning under uncertainty. We frame this as a selection problem where the goal is to choose the most robust trajectory from a finite set of candidates. We generate each candidate using a kinodynamic state space planner and evaluate it using noisy rollouts.Our key insight is we can formalize the selection problem as the "best arm" variant of the multi-armed bandit problem. We use the successive rejects algorithm to efficiently allocate rollouts between candidate trajectories given a rollout budget. We show that the successive rejects algorithm identifies the best candidate using fewer rollouts than a baseline algorithm in simulation. We also show that selecting a good candidate increases the likelihood of successful execution on a real robot.

查看译文

关键词

robust trajectory selection,rearrangement planning,multiarmed bandit problem,open-loop trajectories,kinodynamic state space planner,real robot

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要