您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 管理科学与工程 > IEEE Transactions on Automatic Control > 2022 > 10期

Convergence of Dynamic Programming on the Semidefinite Cone for Discrete-Time Infinite-Horizon LQR

成果类型：

Article

署名作者：

Lee, Donghwan

署名单位：

Korea Advanced Institute of Science & Technology (KAIST)

刊物名称：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

ISSN/ISSBN：

0018-9286

DOI：

10.1109/TAC.2022.3181752

发表日期：

2022

页码：

5661-5668

关键词：

convergence dynamic programming linear time-invariant (LTI) system optimal control Reinforcement Learning

摘要：

The goal of this article is to investigate new and simple convergence analysis of dynamic programming for the linear-quadratic regulator problem of discrete-time linear time-invariant systems. In particular, bounds on errors are given in terms of both matrix inequalities and matrix norm. Under a mild assumption on the initial parameter, we prove that the Q-value iteration exponentially converges to the optimal solution. Moreover, a global asymptotic convergence is also presented. These results are then extended to the policy iteration. We prove that in contrast to the Q-value iteration, the policy iteration always converges exponentially fast. An example is given to illustrate the results.