Optimized Backstepping Combined With Dynamic Surface Technique for Single-Input–Single-Output Nonlinear Strict-Feedback System

IEEE Transactions on Systems, Man, and Cybernetics: Systems(2024)

引用 0|浏览0
暂无评分
摘要
In this article, for the single-input–single-output (SISO) nonlinear strict-feedback system, optimized backstepping (OB) control combined with the dynamic surface (DS) technique is developed. OB is to make every subsystem control of backstepping as the optimized one so as to ensure the entire backstepping control being optimized. However, the original design of OB still needs to repeatedly calculate the derivative of virtual controls, as a result, it will inevitably cause the problem of “differential explosion.” In order to alleviate the phenomenon, the OB control is combined with the DS technique. Furthermore, OB control needs to conduct with reinforcement learning (RL) in every backstepping step, hence simplifying the algorithm of RL is very necessary and substantive for achieving the combination. In this work, because the optimized control derives both critic and actor training laws by utilizing a simple positive function instead of the square of approximation of Hamilton–Jacobi–Bellman (HJB) equation, it can obviously simplify the RL algorithm to compare with the traditional optimizing methods. Finally, the feasibility is illustrated via both theory and simulation.
更多
查看译文
关键词
Dynamic surface (DS),neural network (NN),nonlinear strict-feedback system,optimal control,reinforcement learning (RL)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要