| Synthesis of strategies from interaction traces |
| Full text |
Pdf
(715 KB)
|
Source
|
International Conference on Autonomous Agents
archive
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
table of contents
Estoril, Portugal
SESSION: Economic paradigms
table of contents
Pages 855-862
Year of Publication: 2008
ISBN:978-0-9817381-1-6
|
|
Authors
|
|
Tsz-Chiu Au
|
University of Maryland, College Park, MD
|
|
Sarit Kraus
|
Bar-Ilan University, Ramat Gan, Israel
|
|
Dana Nau
|
University of Maryland, College Park, MD
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 34, Citation Count: 0
|
|
|
ABSTRACT
We describe how to take a set of interaction traces produced by different pairs of players in a two-player repeated game, and combine them into a composite strategy. We provide an algorithm that, in polynomial time, can generate the best such composite strategy. We describe how to incorporate the composite strategy into an existing agent, as an enhancement of the agent's original strategy. We provide experimental results using interaction traces from 126 agents (most of them written by students as class projects) for the Iterated Prisoner's Dilemma, Iterated Chicken Game, and Iterated Battle of the Sexes. We compared each agent with the enhanced version of that agent produced by our algorithm. The enhancements improved the agents' scores by about 5% in the IPD, 11% in the ICG, and 26% in the IBS, and improved their rank by about 12% in the IPD, 38% in the ICG, and 33% in the IBS.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Axelrod. The Evolution of Cooperation. Basic Books, 1984.
|
| |
2
|
|
| |
3
|
D. Billings, N. Burch, A. Davidson, R. Holte, and J. Schaeffer. Approximating game-theoretic optimal strategies for full-scale poker. In IJCAI, pages 661--668, 2003.
|
| |
4
|
|
| |
5
|
M. Deutsch. The Resolution of Conflict: Constructive and Destructive Processes. Yale University Press, 1973.
|
| |
6
|
C. Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. JAIR, 16:59--104, 2002.
|
| |
7
|
D. Egnor. Iocaine powder explained. ICGA Journal, 23(1):33--35, 2000.
|
| |
8
|
G. Kendall, X. Yao, and S. Y. Chong. The Iterated Prisoner's Dilemma: 20 Years On. World Scientific, 2007.
|
| |
9
|
|
| |
10
|
J. Li. How to design a strategy to win an IPD tournament. In G. Kendall, X. Yao, and S. Y. Chong, editors, The Iterated Prisoner's Dilemma: 20 Years On, pages 89--104. World Scientific, 2007.
|
| |
11
|
R. D. Luce and H. Raiffa. Games and Decisions: Introduction and Critical Survey. Wiley, 1957.
|
| |
12
|
|
| |
13
|
A. Parker, D. Nau, and V. Subrahmanian. Overconfidence or paranoia? search in imperfect-information games. In AAAI, July 2006.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
|