您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 概率 > The Annals of Applied Probability > 2023 > 3期

A DYNAMIC ANALYTIC METHOD FOR RISK-AWARE CONTROLLED MARTINGALE PROBLEMS

成果类型：

Article

署名作者：

Isohatala, Jukka; Haskell, William B.

署名单位：

National University of Singapore; Purdue University System; Purdue University

刊物名称：

ANNALS OF APPLIED PROBABILITY

ISSN/ISSBN：

1050-5164

DOI：

10.1214/22-AAP1794

发表日期：

2023

页码：

1661-1700

关键词：

markov decision-processes jacobi-bellman equations time-average control diffusion-processes EXISTENCE PRINCIPLE

摘要：

We present a new, tractable method for solving risk-aware problems over finite and infinite, discounted time-horizons where the dynamics of the con-trolled process are described using the martingale method. Supposing gen-eral Polish state and action spaces, and using the martingale characteriza-tion, we state a risk-aware dynamic optimal control problem of minimizing risk of costs described by a generic risk function. From this, we construct an alternative formulation of the optimization problem that takes the form of a nonlinear programming problem, constrained by the dynamic, that is, time-dependent and linear Kolmogorov forward equation describing the time -dependent distribution of the state and running costs. This formulation is sim-ilar to the convex analytic method, in that the control problem is recast into a form where the objective is optimized over distributions representing the state space visitation frequencies. However, in our approach, the distributions are dynamic and also encode the cost distribution. As our main results, we prove the equivalence of the original martingale and dynamic analytic problems, in the sense that both have the same optimal values, and that the solution of either problem yields a solution of the other. Moreover, we find an optimal control process can be taken to be Markov in the controlled process state, running costs, and time. We further show that under additional assumptions the optimal value is attained. An example numeric problem is presented and solved.

来源URL：

访问原文