ON THE 2ND OPTIMALITY EQUATION FOR SEMI-MARKOV DECISION-MODELS

成果类型:
Article
署名作者:
SCHAL, M
刊物名称:
MATHEMATICS OF OPERATIONS RESEARCH
ISSN/ISSBN:
0364-765X
DOI:
10.1287/moor.17.2.470
发表日期:
1992
页码:
470-486
关键词:
unbounded rewards optimal policies average chains
摘要:
For a semi-Markov decision model with average return, the validity of the second optimality equation is shown in the (nonmodified) form where the actions run through the set of all admissible actions rather than through the set of maximum points (conserving actions) for the first optimality equation. As a consequence the existence of a strongly optimal stationary policy is shown. The results seem to be known only for finite state finite action models whereas here countable state compact action models with unbounded rewards are considered.