Convergence rate of linear two-time-scale stochastic approximation

成果类型:
Article
署名作者:
Konda, VR; Tsitsiklis, JN
署名单位:
Massachusetts Institute of Technology (MIT)
刊物名称:
ANNALS OF APPLIED PROBABILITY
ISSN/ISSBN:
1050-5164
DOI:
10.1214/105051604000000116
发表日期:
2004
页码:
796-819
关键词:
摘要:
We study the rate of convergence of linear two-time-scale stochastic approximation methods. We consider two-time-scale linear iterations driven by i.i.d. noise, prove some results on their asymptotic covariance and establish asymptotic normality. The well-known result [Polyak, B. T. (1990). Automat. Remote Contr 51 937-946; Ruppert, D. (1988). Technical Report 781, Cornell Univ.] on the optimality of Polyak-Ruppert averaging techniques specialized to linear stochastic approximation is established as a consequence of the general results in this paper.