ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Integrating organizational control into multi-agent learning
Full text PdfPdf (557 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Multi-agent learning table of contents
Pages: 757-764  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Chongjie Zhang  University of Massachusetts, Amherst, MA
Sherief Abdallah  British University in Dubai, Dubai, United Arab Emirates
Victor Lesser  University of Massachusetts, Amherst, MA
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 53,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop an organization-based control framework to speed up the convergence of MARL algorithms in a network of agents. Our framework defines a multi-level organizational structure for automated supervision and a communication protocol for exchanging information between lower-level agents and higher-level supervising agents. The abstracted states of lower-level agents travel upwards so that higher-level supervising agents generate a broader view of the state of the network. This broader view is used in creating supervisory information which is passed down the hierarchy. The supervisory policy adaptation then integrates supervisory information into existing MARL algorithms, guiding agents' exploration of their state-action space. The generality of our framework is verified by its applications on different domains (distributed task allocation and network routing) with different MARL algorithms. Experimental results show that our framework improves both the speed and likelihood of MARL convergence.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI'07, Hyderabad, India, 2007.
 
4
J. A. Boyan and M. L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In NIPS'94, volume 6, pages 671--678, 1994.
5
 
6
 
7
L. Peshkin and V. Savova. Reinforcement learning for adaptive routing. In International Joint Conference on Neural Networks (IJCNN), 2002.
 
8
M. T. Rosenstein and A. G. Barto. Supervised actor-critic reinforcement learning. In J. Si, A. Barto, W. Powell, and D. Wunsch, editors, Learning and Approximate Dynamic Programming: Scaling Up to the Real World, pages 359--380. John Wiley and Sons, 2004.
 
9
H. A. Simon. Nearly-decomposable systems. In The Sciences of the Artificial, pages 99--103, 1969.
 
10
11
 
12
P. Tangamchit, J. Dolan, and P. Khosla. Learning-based task allocation in decentralized multirobot systems. In DARS'00, pages 381--390, 2000.
 
13
 
14
C. Zhang, V. Lesser, and S. Abdallah. Self-organization for dynamically supervising distributed learning. In University of Massachusetts Amherst Computer Science Technical Report UM-CS-2009-007, 2009.
15
 
16
M. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML'03, pages 928--936, 2003.

Collaborative Colleagues:
Chongjie Zhang: colleagues
Sherief Abdallah: colleagues
Victor Lesser: colleagues