LQR controller design for affine LPV systems using reinforcement learning

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE(2024)

引用 0|浏览0
暂无评分
摘要
In this paper, a data-driven sub-optimal state feedback is designed for a continuous time linear parameter varying (LPV) system using reinforcement learning. Time-varying parameters lie in a poly top and the system matrix has an affine representation for the parameters. Two novels, on-policy and off-policy algorithms, are proposed using available data from vertex systems of polytop to minimise a performance index and admit a common Lyapunov function (CLF). A convex optimisation problem is derived for each iteration based on Lyapunov inequality. Algorithms yield stabilising feedback gain in each iteration and convergence to a common lyapunov function. We demonstrate the efficacy of the proposed method by simulation of two case studies.
更多
查看译文
关键词
Linear quadratic regulator (LQR), reinforcement learning, data-driven control, LPV systems, common lLyapunov function
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要