订阅小程序
旧版功能

Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization

IEEE Transactions on Automatic Control(2025)

引用 8|浏览13
关键词
Reinforcement learning,policy gradient,stochastic approximation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要