Stability guarantees for nonlinear discrete-time systems controlled by approximate value iteration

2019 IEEE 58th Conference on Decision and Control (CDC)(2019)

引用 3|浏览17
暂无评分
摘要
Value iteration is a method to generate optimal control inputs for generic nonlinear systems and cost functions. Its implementation typically leads to approximation errors, which may have a major impact on the closed-loop system performance. We talk in this case of approximate value iteration (AVI). In this paper, we investigate the stability of systems for which the inputs are obtained by AVI. We consider deterministic discrete-time nonlinear plants and a class of general, possibly discounted, costs. We model the closed-loop system as a family of systems parameterized by tunable parameters, which are used for the approximation of the value function at different iterations, the discount factor and the iteration step at which we stop running the algorithm. It is shown, under natural stabilizability and detectability properties as well as mild conditions on the approximation errors, that the family of closed-loop systems exhibit local practical stability properties. The analysis is based on the construction of a Lyapunov function given by the sum of the approximate value function and the Lyapunov-like function that characterizes the detectability of the system. By strengthening our conditions, asymptotic and exponential stability properties are guaranteed.
更多
查看译文
关键词
exponential stability properties,stability guarantees,nonlinear discrete-time systems,approximate value iteration,optimal control inputs,generic nonlinear systems,cost functions,approximation errors,closed-loop system performance,AVI,discrete-time nonlinear plants,iteration step,local practical stability properties,approximate value function,Lyapunov-like function,asymptotic stability properties
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要