Optimization of the transient and steady-state behavior of discrete event systems
成果类型:
Article
署名作者:
Andradottir, S
署名单位:
University of Wisconsin System; University of Wisconsin Madison
刊物名称:
MANAGEMENT SCIENCE
ISSN/ISSBN:
0025-1909
DOI:
10.1287/mnsc.42.5.717
发表日期:
1996
页码:
717-737
关键词:
stochastic optimization
simulation
stochastic approximation
gradient estimation
Robbins-Monro algorithm
regenerative method
likelihood ratio method
Markov chains
Harris recurrence
摘要:
We present a general framework for applying simulation to optimize the behavior of discrete event systems. Our approach involves modeling the discrete event system under study as a general state space Markov chain whose distribution depends on the decision parameters. We then show how simulation and the likelihood ratio method can be used to evaluate the performance measure of interest and its gradient, and we present conditions that guarantee that the Robbins-Monro stochastic approximation algorithm will converge almost surely to the optimal values of the decision parameters. Both transient and steady-state performance measures are considered. For steady-state performance measures, we consider both the case when the Markov chain of interest is regenerative in the standard sense, as well as the case when this Markov chain is Harris recurrent, and thereby regenerative in a wider sense.
来源URL: