您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 管理科学与工程 > IEEE Transactions on Automatic Control > 2021 > 1期

On Passivity, Reinforcement Learning, and Higher Order Learning in Multiagent Finite Games

成果类型：

Article

署名作者：

Gao, Bolin; Pavel, Lacra

署名单位：

University of Toronto

刊物名称：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

ISSN/ISSBN：

0018-9286

DOI：

10.1109/TAC.2020.2978037

发表日期：

2021

页码：

121-136

关键词：

games CONVERGENCE Learning (artificial intelligence) Heuristic algorithms sociology statistics Radio frequency Agents and autonomous systems game theory Nonlinear systems Passivity Reinforcement Learning

摘要：

In this article, we propose a passivity-based methodology for the analysis and design of reinforcement learning dynamics and algorithms in multiagent finite games. Starting from a known, first-order reinforcement learning scheme, we show that convergence to a Nash distribution can be attained in a broader class of games than previously considered in the literature-namely, in games characterized by the monotonicity property of their (negative) payoff vectors. We further exploit passivity techniques to design a class of higher order learning schemes that preserve the convergence properties of their first-order counterparts. Moreover, we show that the higher order schemes improve upon the rate of convergence and can even achieve convergence where the first-order scheme fails. We demonstrate these properties through numerical simulations for several representative games.