Q-Learning Design for Discrete-Time Stochastic Zero-Sum Games

2023 China Automation Congress (CAC)(2023)

引用 0|浏览0
暂无评分
摘要
This study explores the application of the model-free Q-Learning algorithm in discrete-time linear quadratic stochastic zero-sum games. The main contribution is two-fold: firstly, extending the zero-sum game concept to stochastic situation and resolving it through a model-based approach; secondly, introducing a model-free Q-Learning algorithm as an innovative method, differing from conventional policy iteration and value iteration. Detailed mathematical demonstrations are included, validating the model-free Q-Learning algorithm's convergence within this research. A numerical example is provided to demonstrate the algorithm's effectiveness.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要