您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 经济学 > Games and Economic Behavior > 2008 > 1期

Regret minimization in repeated matrix games with variable stage duration

成果类型：

Article

署名作者：

Mannor, Shie; Shinikin, Nahum

署名单位：

Technion Israel Institute of Technology; McGill University

刊物名称：

GAMES AND ECONOMIC BEHAVIOR

ISSN/ISSBN：

0899-8256

DOI：

10.1016/j.geb.2007.07.006

发表日期：

2008

页码：

227-258

关键词：

no-regret strategies Regret minimization Hannan consistency best-response envelope repeated matrix games variable duration games approachability calibrated play

摘要：

Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, AW., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol. 39, Princeton Univ. Press, Princeton, NJ, pp. 97-193]. Several classes of no-regret strategies now exist; such strategies secure a long-term average payoff as high as could be obtained by the fixed action that is best, in hindsight, against the observed action sequence of the opponent. We consider an extension of this framework to repeated games with variable stage duration, where the duration of each stage may depend on actions of both players, and the performance measure of interest is the average payoff per unit time. We start by showing that no-regret strategies, in the above sense, do not exist in general. Consequently, we consider two classes of adaptive strategies, one based on Blackwell's approachability theorem and the other on calibrated play, and examine their performance guarantees. We. further provide sufficient conditions for existence of no-regret strategies in this model. (c) 2007 Elsevier Inc. All rights reserved.

来源URL：

访问原文