您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 管理科学与工程 > Mathematics of Operations Research > 1992 > 4期

THE OPTIMAL REWARD OPERATOR IN NEGATIVE DYNAMIC-PROGRAMMING

成果类型：

Article

署名作者：

MAITRA, A; SUDDERTH, W

刊物名称：

MATHEMATICS OF OPERATIONS RESEARCH

ISSN/ISSBN：

0364-765X

DOI：

10.1287/moor.17.4.921

发表日期：

1992

页码：

921-931

关键词：

gambling problems

摘要：

We consider the negative dynamic programming model of Strauch [12] and prove that the optimal reward function can be obtained by a transfinite iteration of the optimal reward operator. We show that a player loses nothing by restricting himself to measurable policies, if the returns from nonmeasurable policies are evaluated by lower integrals.