Reinforcement learning in population games

成果类型:
Article
署名作者:
Lahkar, Ratul; Seymour, Robert M.
署名单位:
KREA University; IFMR - Graduate School of Business (GSB); University of London; University College London
刊物名称:
GAMES AND ECONOMIC BEHAVIOR
ISSN/ISSBN:
0899-8256
DOI:
10.1016/j.geb.2013.02.006
发表日期:
2013
页码:
10-38
关键词:
Reinforcement learning Continuity equation Replicator dynamics
摘要:
We study reinforcement learning in a population game. Agents in a population game revise mixed strategies using the Cross rule of reinforcement learning. The population state the probability distribution over the set of mixed strategies evolves according to the replicator continuity equation which, in its simplest form, is a partial differential equation. The replicator dynamic is a special case in which the initial population state is homogeneous, i.e. when all agents use the same mixed strategy. We apply the continuity dynamic to various classes of symmetric games. Using 3 x 3 coordination games, we show that equilibrium selection depends on the variance of the initial strategy distribution, or initial population heterogeneity. We give an example of a 2 x 2 game in which heterogeneity persists even as the mean population state converges to a mixed equilibrium. Finally, we apply the dynamic to negative definite and doubly symmetric games. (C) 2013 Elsevier Inc. All rights reserved.