ACM Home Page
Please provide us with feedback. Feedback
Learning to cooperate in a continuous tragedy of the commons
Full text PdfPdf (235 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Interactions table of contents
Pages 1185-1186  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Steven de Jong  Maastricht Univ., Maastricht, Netherlands
Karl Tuyls  Eindhoven University of Technology, Eindhoven, Netherlands
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

In previous work, we discussed that social dilemmas are often present in multi-agent systems [3]. Social dilemmas are problems in which we can only find a good solution if we consider the benefit of others in addition to our own benefit. Altruistic punishment has been identified as an important mechanism to enforce this consideration. However, as the punishment is altruistic, deciding whether to punish essentially entails a second-order social dilemma. We developed a methodology that allowed individually learning agents to reach satisfactory solutions in a social dilemma with a continuous strategy space, called the Ultimatum Game [2]. We extended this methodology to thousands of agents, using social networks [4]. Moreover, we devoted attention to the tragedy of the commons, a social dilemma typically exemplified by the Public Goods Game (PGG) [1]. In this game, which is played repeatedly, every agent i (out of n) has to decide on an investment μi ε [0, C]. The summed investment is multiplied by a factor 1 < r < n, and equally distributed over all agents. Agent i's individual benefit (or reward) is maximized by μi = 0, whereas the group gains the most by collectively playing μi = C. Altruistic punishment (i.e., reducing an other agent's reward by an amount e, with a cost c < e to the punisher) allows agents to force others to invest a higher amount, but performing such punishment is clearly not individually rational. In earlier work, we restricted ourselves to a small number of strategies and/or agents in this game [1].


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. de Jong and K. Tuyls. Learning to cooperate in public-goods interactions. 2008. Presented at the EUMAS'08 Workshop, Bath, UK, December 18--19.
 
2
 
3
 
4
S. de Jong, S. Uyttendaele, and K. Tuyls. Learning to Reach Agreement in a Continuous Ultimatum Game. Journal of Artificial Intelligence Research, 33:551--574, 2008.
 
5
E. Fehr and K. Schmidt. A Theory of Fairness, Competition and Cooperation. Quart. J. of Economics, 114:817--868, 1999.
 
6
F. C. Santos, J. M. Pacheco, and T. Lenaerts. Cooperation Prevails When Individuals Adjust Their Social Ties. PLoS Comput. Biol., 2(10): 1284--1291, 2006.
 
7

Collaborative Colleagues:
Steven de Jong: colleagues
Karl Tuyls: colleagues