An integrated design for intensified direct heuristic dynamic programming

ADPRL(2013)

引用 8|浏览5
暂无评分
摘要
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
更多
查看译文
关键词
neural network,approximate dynamic programming,lyapunov stability approach,reinforcement signal,neurocontrollers,adp technique,integrated design method,machine intelligence system,three-network architecture,multiple pid neural networks,control system analysis,pid neural network,uniformly ultimately boundedness,intensified direct heuristic dynamic programming,uub,adaptive critic neural networks,direct heuristic dynamic programming,dynamic programming,stability,pidnns-based intensified direct hdp learning controller,lyapunov methods,three-term control,algorithm design and analysis,neural networks,learning artificial intelligence,convergence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要