您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 管理科学与工程 > IEEE Transactions on Automatic Control > 2023 > 8期

Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance

成果类型：

Article

署名作者：

Doan, Thinh T. T.

署名单位：

Virginia Polytechnic Institute & State University

刊物名称：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

ISSN/ISSBN：

0018-9286

DOI：

10.1109/TAC.2022.3210147

发表日期：

2023

页码：

4695-4705

关键词：

Reinforcement learning stochastic approximation two-time-scale stochastic approximation

摘要：

Two-time-scale stochastic approximation, a generalized version of the popular stochastic approximation, has found broad applications in many areas including stochastic control, optimization, and machine learning. Despite its popularity, theoretical guarantees of this method, especially its finite-time performance, are mostly achieved for the linear case while the results for the nonlinear counterpart are very sparse. Motivated by the classic control theory for singularly perturbed systems, we study in this article the asymptotic convergence and finite-time analysis of the nonlinear two-time-scale stochastic approximation. Under some fairly standard assumptions, we provide a formula that explicitly characterizes the rate of convergence of the main iterates to the desired solutions. In particular, we show that the mean square error generated by the method convergences to zero at a rate O(1/k(2/3)), where k is the number of iterations. The key idea in our analysis is to properly choose the two step sizes to characterize the coupling between the fast and slow time-scale iterates.