Structured Output Feedback Control for Linear Quadratic Regulator Using Policy Gradient Method

成果类型:
Article
署名作者:
Takakura, Shokichi; Sato, Kazuhiro
署名单位:
University of Tokyo
刊物名称:
IEEE TRANSACTIONS ON AUTOMATIC CONTROL
ISSN/ISSBN:
0018-9286
DOI:
10.1109/TAC.2023.3264176
发表日期:
2024
页码:
363-370
关键词:
Output feedback gradient methods CONVERGENCE linear programming Symmetric matrices estimation State feedback Data-driven control Gradient descent linear quadratic regulator (LQR) model free control Nonconvex Optimization reinforcement learning (RL)
摘要:
In this article, we consider the static output feedback control for linear quadratic regulator problems with structured constraints under the assumption that system parameters are unknown. To solve the problem in the model free setting, we propose the policy gradient algorithm based on the gradient projection method and show its global convergence to epsilon-stationary points. In addition, we introduce a variance reduction technique and show both theoretically and numerically that it significantly reduces the variance in the gradient estimation. We also show in the numerical experiments that the model free approach efficiently solves the problem.