Playing Wordle Using an Online Rollout Algorithm for Deterministic POMDPs.

2023 IEEE Conference on Games (CoG)(2023)

引用 0|浏览1
暂无评分
摘要
In this paper, we consider an important class of Partially Observable Markov Decision Processes (POMDP) with unknown parameters, which contains the Wordle puzzle as a special case. For this class of POMDP, we develop a new on-line solution method, which is based on the rollout approach. Our method relies on the use of a base heuristic policy and guarantees cost improvement over that policy. When applied to Wordle, our algorithm solves the puzzle on-line. The performance is within 0.4% of the known optimal results, and is substantially better than that of the base heuristic policies we have tested.
更多
查看译文
关键词
POMDP,Wordle,Rollout,Dynamic Programming
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要