Bottom-Up Learning of Hierarchical Models in a Class of Deterministic Pomdp Environments

Periodicals(2015)

引用 3|浏览17
暂无评分
摘要
AbstractAbstract The theory of partially observable Markov decision processes POMDPs is a useful tool for developing various intelligent agents, and learning hierarchical POMDP models is one of the key approaches for building such agents when the environments of the agents are unknown and large. To learn hierarchical models, bottom-up learning methods in which learning takes place in a layer-by-layer manner from the lowest to the highest layer are already extensively used in some research fields such as hidden Markov models and neural networks. However, little attention has been paid to bottom-up approaches for learning POMDP models. In this paper, we present a novel bottom-up learning algorithm for hierarchical POMDP models and prove that, by using this algorithm, a perfect model i.e., a model that can perfectly predict future observations can be learned at least in a class of deterministic POMDP environments
更多
查看译文
关键词
partially observable Markov decision processes, hierarchical models, bottom-up learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要