Temporal Difference Learning with Multi-Step Returns for Intelligent Optimal Control of Dynamic Systems
NEUROCOMPUTING(2025)
关键词
Adaptive dynamic programming,Convergence,Optimal control,Reinforcement learning,Temporal difference
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要