Simultaneous active parameter estimation and control using sampling-based Bayesian reinforcement learning

Patrick Slade,Preston Culbertson,Zachary Sunberg,Mykel Kochenderfer

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)（2017）

引用 19|浏览39

暂无评分

摘要

Robots performing manipulation tasks must operate under uncertainty about both their pose and the dynamics of the system. In order to remain robust to modeling error and shifts in payload dynamics, agents must simultaneously perform estimation and control tasks. However, the optimal estimation actions are often not the optimal actions for accomplishing the control tasks, and thus agents trade between exploration and exploitation. This work frames the problem as a Bayes-adaptive Markov decision process and solves it online using Monte Carlo tree search and an extended Kalman filter to handle Gaussian process noise and parameter uncertainty in a continuous space. MCTS selects control actions to reduce model uncertainty and reach the goal state nearly optimally. Certainty equivalent model predictive control is used as a benchmark to compare performance in simulations with varying process noise and parameter uncertainty.

查看译文

关键词

manipulation tasks,model predictive control,robots,process noise,model uncertainty,parameter uncertainty,extended Kalman filter,Monte Carlo tree search,Bayes-adaptive Markov decision process,optimal estimation actions,control tasks,payload dynamics,Bayesian reinforcement learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要