Reversible Markov decision processes and the Gaussian free field

Systems & Control Letters(2022)

引用 0|浏览0
暂无评分
摘要
A Markov decision problem is called reversible if the stationary controlled Markov chain is reversible under every stationary Markovian strategy. A natural application in which such problems arise is in the control of Metropolis–Hastings type dynamics. We characterize all discrete time reversible Markov decision processes with finite state and action spaces. We show that the policy iteration algorithm for finding an optimal policy can be significantly simplified in Markov decision problems of this type. We also highlight the relation between the finite time evolution of the accrual of reward and the Gaussian free field associated to the controlled Markov chain.
更多
查看译文
关键词
Biconnectedness,Gaussian free field,Markov decision process,Markov chain Monte Carlo,Reversibility
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要