ACM Home Page
Please provide us with feedback. Feedback
Theory of moves learners: towards non-myopic equilibria
Full text PdfPdf (354 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems table of contents
The Netherlands
SESSION: Papers: learning table of contents
Pages: 74 - 80  
Year of Publication: 2005
ISBN:1-59593-093-0
Authors
Arjita Ghosh  University of Tulsa
Sandip Sen  University of Tulsa
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 23,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1082473.1082485
What is a DOI?

ABSTRACT

In contrast to classical game theoretic analysis of simultaneous and sequential play in bimatrix games, Steven Brams has proposed an alternative framework called the Theory of Moves (TOM) where players can choose their initial actions and then, in alternating turns, decide to shift or not from its current action. A backward induction process is used to determine a non-myopic action and equilibrium is reached when an agent, on its turn to move, decides to not change its current action. Brams claims that the TOM framework captures the dynamics of a wide range of real-life non-cooperative negotiations ranging over political, historical, and religious disputes. We believe that his analysis is weakened by the assumption that a player has perfect knowledge of the opponent's payoff. We present a learning approach by which TOM players can learn to converge to Non-Myopic Equilibria (NME) without prior knowledge of its opponent's preferences and by inducing them from past choices made by the opponent. We present experimental results from all structurally distinct 2-by-2 games without a common preferred outcome showing convergence of our proposed learning player to NMEs. We also discuss the relation between equilibriums in sequential games and NMEs of TOM.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
B. Banerjee, S. Sen, and J. Peng. Fast concurrent reinforcement learners. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pages 825--830, 2001.
 
2
 
3
S. J. Brams. Theory of Moves. Cambridge University Press, Cambridge: UK, 1994.
 
4
 
5
V. Conitzer and T. Sandholm. AWESOME: A general multiagent learning algorithm that converges in self-play. In Twentieth International Conference on Machine Learning, pages 83--90, San Francisco, CA, 2003: Morgan Kaufmann.
 
6
 
7
 
8
M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 157--163, San Mateo, CA, 1994. Morgan Kaufmann.
 
9
 
10
 
11
R. B. Myerson. Game Theory: Analysis of Conflict. Harvard University Press, 1991.
 
12
J. F. Nash. Non-cooperative games. Annals of Mathematics, 54:286 -- 295, 1951.
 
13
A. Rapoport and M. Guyer. A taxonomy of 2x2 games. General Systems, 11:203--214, 1966.


Collaborative Colleagues:
Arjita Ghosh: colleagues
Sandip Sen: colleagues