Long-Run Multi-Robot Planning under Uncertain Action Durations for Persistent Tasks.

IROS(2020)

引用 7|浏览10
暂无评分
摘要
This paper presents an approach for multi-robot long-term planning under uncertainty over the duration of actions. The proposed methodology takes advantage of generalized stochastic Petri nets with rewards (GSPNR) to model multi-robot problems. A GSPNR allows for unified modeling of action selection, uncertainty on the duration of action execution, and for goal specification through the use of transition rewards and rewards per time unit. Our approach relies on the interpretation of the GSPNR model as an equivalent embedded Markov reward automaton (MRA). We then build on a state-of-the-art method to compute the long-run average reward over MRAs, extending it to enable the extraction of the optimal policy. We provide an empirical evaluation of the proposed approach on a simulated multi-robot monitoring problem, evaluating its performance and scalability. The results show that the synthesized policy outperforms a policy obtained from an infinite horizon discounted reward formulation as well as a carefully hand-crafted policy.
更多
查看译文
关键词
long-run average reward,multirobot monitoring problem,reward formulation,uncertain action durations,persistent tasks,multirobot long-term planning,unified modeling,GSPNR model,long-run multirobot planning,generalized stochastic Petri nets with rewards,Markov reward automaton
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要