Temporal Parallelization of the HJB Equation and Continuous-Time Linear Quadratic Control
成果类型:
Article
署名作者:
Sarkka, Simo; Garcia-Fernandez, Angel F.
署名单位:
Aalto University; University of Liverpool
刊物名称:
IEEE TRANSACTIONS ON AUTOMATIC CONTROL
ISSN/ISSBN:
0018-9286
DOI:
10.1109/TAC.2024.3518309
发表日期:
2025
页码:
3755-3770
关键词:
Optimal control
mathematical models
trajectory
dynamic programming
viscosity
time complexity
Symmetric matrices
Graphics processing units
COSTS
reviews
Continuous-time control
graphics processing unit (GPU)
Hamilton-Jacobi--Bellman (HJB) equation
linear quadratic tracking
multicore
parallel computing
摘要:
This article presents a mathematical formulation to perform temporal parallelization of continuous-time optimal control problems, which can be solved via the Hamilton-Jacobi-Bellman (HJB) equation. We divide the time interval of the control problem into subintervals, and define a control problem in each subinterval, conditioned on the start and end states, leading to conditional value functions for the subintervals. By defining an associative operator as the minimization of the sum of conditional value functions, we obtain the elements and associative operators for a parallel associative scan operation. This allows for solving the optimal control problem on the whole time interval in parallel in logarithmic time complexity in the number of subintervals. We derive the HJB-type of backward and forward equations for the conditional value functions and solve them in closed form for linear quadratic problems. We also discuss numerical methods for computing the conditional value functions. The computational advantages of the proposed parallel methods are demonstrated via simulations run on a multicore central processing unit and a graphics processing unit.