|
ABSTRACT
We study a general sub-class of concave games which we call socially concave games. We show that if each player follows any no-external regret minimization procedure then the dynamics will converge in the sense that both the average action vector will converge to a Nash equilibrium and that the utility of each player will converge to her utility in that Nash equilibrium. We show that many natural games are indeed socially concave games. Specifically, we show that linear Cournot competition and linear resource allocation games are socially-concave games, and therefore our convergence result applies to them. In addition, we show that a simple best response dynamics might diverge for linear resource allocation games, and is known to diverge for linear Cournot competition. For the TCP congestion games we show that "near" the equilibrium the games are socially-concave, and using our general methodology we show the convergence of a specific regret minimization dynamics.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
 |
3
|
Avrim Blum , MohammadTaghi Hajiaghayi , Katrina Ligett , Aaron Roth, Regret minimization and the price of total anarchy, Proceedings of the 40th annual ACM symposium on Theory of computing, May 17-20, 2008, Victoria, British Columbia, Canada
[doi> 10.1145/1374376.1374430]
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
A. A. Cournot. Recherches sur les principes mathÈmatiques de la theorie des richesses.(english translation: Researches into the mathematical principles of the theory of wealth). 1838.
|
| |
8
|
|
| |
9
|
D. Foster and S. M. Kakade. Deterministic calibration and nash equilibrium. In COLT, pages 33--48, 2004.
|
| |
10
|
D. Foster and R. Vohra. Regret in the on-line decision problem. Games and Economic Behavior, 21:40--55, 1997.
|
| |
11
|
D. Foster and H. P. Young. Learning, hypothesis testing, and Nash equilibrium. Games and Economic Behavior, 45:73--96, 2003.
|
| |
12
|
D. Foster and H. P. Young. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theoretical Economics, 1:341--367, 2006.
|
| |
13
|
Y. Freund and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79--103, 1999.
|
| |
14
|
Drew Fudenberg and David K. Levine. The theory of learning in games. MIT press, 1998.
|
| |
15
|
F. Germano and G. Lugosi. Global Nash convergence of Foster and Young's regret testing. Games and Economic Behavior, 60:135--154, 2007.
|
| |
16
|
B. Hajek and G. Gopalakrishnan. Do greedy autonomous systems make for a sensible internet?, 2002. presented at the Conference on Stochastic Networks, Stanford University.
|
 |
17
|
|
| |
18
|
S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127--1150, 2000.
|
| |
19
|
S. Hart and A. Mas-Colell. Stochastic uncoupled dynamics and Nash equilibrium. Games and Economice Behavior, 57(2):286--303, 2006.
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
F. P. Kelly. Charging and rate control for elastic traffic. European Transactions on Telecommunications, page 33ñ37, 1997.
|
| |
25
|
A. Mas-Colell, J. Green, and M. D. Whinston. Microeconomic Theory. Oxford University Press, 1995.
|
| |
26
|
J. B. Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica, 33(3):520--534, 1965.
|
| |
27
|
R. D. Theocharis. On the stability of the cournot solution on the oligopoly problem. The Review of Economic Studies., 27(2):133--134, 1960.
|
| |
28
|
H. P. Young. Strategic Learning and Its Limits. Oxford University Press, 2004.
|
| |
29
|
M. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML, pages 928--936, 2003.
|
| |
30
|
M. Zinkevich. Theoretical guarantees for algorithms in multi-agent settings. Technical Report CMU-CS-04-161,, Carnegie Mellon University, 2004.
|
|