Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

SODA: Symposium on Discrete Algorithms(2010)

引用 33|浏览29
暂无评分
摘要
We present two new algorithms for finding optimal strategies for discounted, infinite-horizon, Deterministic Markov Decision Processes (DMDP). The first one is an adaptation of an algorithm of Young, Tarjan and Orlin for finding minimum mean weight cycles. It runs in O(mn + n2 log n) time, where n is the number of vertices (or states) and m is the number of edges (or actions). The second one is an adaptation of a classical algorithm of Karp for finding minimum mean weight cycles. It runs in O(mn) time. The first algorithm has a slightly slower worst-case complexity, but is faster than the first algorithm in many situations. Both algorithms improve on a recent O(mn2)-time algorithm of Andersson and Vorobyov. We also present a randomized Õ(m1/2n2)-time algorithm for finding Discounted All-Pairs Shortest Paths (DAPSP), improving several previous algorithms.
更多
查看译文
关键词
classical algorithm,new algorithm,randomized o,discounted all-pairs shortest paths,n2 log n,optimal strategy,fastest algorithm,minimum mean weight cycle,shortest paths,recent o,deterministic markov decision processes,determinsitc markov decision processes,time algorithm,present algorithm,discounted deterministic markov decision,all-pairs shortest path,previous algorithm,minimum mean weight cycles,markov decision processes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要