您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 管理科学与工程 > IEEE Transactions on Automatic Control > 2022 > 6期

Decentralized Online Convex Optimization With Feedback Delays

成果类型：

Article

署名作者：

Cao, Xuanyu; Basar, Tamer

署名单位：

University of Illinois System; University of Illinois Urbana-Champaign

刊物名称：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

ISSN/ISSBN：

0018-9286

DOI：

10.1109/TAC.2021.3092562

发表日期：

2022

页码：

2889-2904

关键词：

Delays optimization Heuristic algorithms Benchmark testing Convex functions sensors measurement Bandit feedback decentralized optimization feedback delays full information online convex optimization (OCO)

摘要：

In online decision making, feedback delays often arise due to the latency caused by computation and communication in practical systems. In this article, we study decentralized online convex optimization over a multiagent network in the presence of feedback delays. Each agent is associated with a time-varying local loss function, which is revealed to the agent sequentially with delays. The goal of every agent is to minimize the accumulated total loss function (the sum of the local loss functions of all agents) by choosing appropriate local actions. We first develop an online decentralized gradient descent algorithm for the delayed full information case, in which the delayed local loss function is fully disclosed to each agent. The (static) regret of each agent is upper bounded in terms of the delays for both convex and strongly convex loss functions. We show that, as long as the average delay among all agents is sublinear, so is the regret of every agent. Further, we extend the algorithm and analysis to the scenario of bandit feedback, where only the values of the delayed loss functions at two random points close to the local action are revealed. We demonstrate that the two-point bandit feedback does not entail regret degradation in order sense compared to the full information scenario. Finally, numerical results are presented to corroborate the efficacy of the proposed algorithms.