THE EXPONENTIAL COST OPTIMALITY FOR FINITE HORIZON SEMI-MARKOV DECISION PROCESSES

KYBERNETIKA(2022)

引用 1|浏览1
暂无评分
摘要
This paper considers an exponential cost optimality problem for finite horizon semi-Markov decision processes (SMDPs). The objective is to calculate an optimal policy with minimal exponential costs over the full set of policies in a finite horizon. First, under the standard regular and compact-continuity conditions, we establish the optimality equation, prove that the value function is the unique solution of the optimality equation and the existence of an optimal policy by using the minimum nonnegative solution approach. Second, we establish a new value iteration algorithm to calculate both the value function and the epsilon-optimal policy. Finally, we give a computable machine maintenance system to illustrate the convergence of the algorithm.
更多
查看译文
关键词
semi-Markov decision processes, exponential cost, finite horizon, optimality equation, optimal policy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要