Robust Trajectory Selection For Rearrangement Planning As A Multi-Armed Bandit Problem

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)(2015)

引用 44|浏览88
暂无评分
摘要
We present an algorithm for generating open-loop trajectories that solve the problem of rearrangement planning under uncertainty. We frame this as a selection problem where the goal is to choose the most robust trajectory from a finite set of candidates. We generate each candidate using a kinodynamic state space planner and evaluate it using noisy rollouts.Our key insight is we can formalize the selection problem as the "best arm" variant of the multi-armed bandit problem. We use the successive rejects algorithm to efficiently allocate rollouts between candidate trajectories given a rollout budget. We show that the successive rejects algorithm identifies the best candidate using fewer rollouts than a baseline algorithm in simulation. We also show that selecting a good candidate increases the likelihood of successful execution on a real robot.
更多
查看译文
关键词
robust trajectory selection,rearrangement planning,multiarmed bandit problem,open-loop trajectories,kinodynamic state space planner,real robot
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要