MAXIMAL REWARDS AND EPSILON-OPTIMAL POLICIES IN CONTINUOUS TIME MARKOV DECISION CHAINS

成果类型:
Article
署名作者:
LEMBERSKY, MR
署名单位:
Oregon State University
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/aos/1176342621
发表日期:
1974
页码:
159-169
关键词: