Improving Value Function Approximation in Factored POMDPs by Exploiting Model Structure

Autonomous Agents and Multi-Agent Systems(2015)

引用 1|浏览12
暂无评分
摘要
Linear value function approximation in Markov decision processes (MDPs) has been studied extensively, but there are several challenges when applying such techniques to partially observable MDPs (POMDPs). Furthermore, the system designer often has to choose a set of basis functions. We propose an automatic method to derive a suitable set of basis functions by exploiting the structure of factored models. We experimentally show that our approximation can reduce the solution size by several orders of magnitude in large problems.
更多
查看译文
关键词
POMDP, Value Function Approximation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要