ACM Home Page
Please provide us with feedback. Feedback
Learning from actions not taken: a multiagent learning algorithm
Full text PdfPdf (171 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Interactions table of contents
Pages 1277-1278  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Newsha Khani  Oregon State University
Kagan Tumer  Oregon State University
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 26,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Learning in multiagent systems is generally slow because the agent has to extract its correct policy through not only through its interaction with the environment, but also from its interactions with other learning agents. In this paper, we present an approach that significantly improves the learning speed in multiagent systems by allowing an agent to up-date its estimate of the rewards for all its available actions, not just the action that was taken. Our results show that the rewards on such "actions not taken" are beneficial early in training, particularly when agent teams are leveraged to estimate those rewards.



Collaborative Colleagues:
Newsha Khani: colleagues
Kagan Tumer: colleagues