ACM Home Page
Please provide us with feedback. Feedback
Solving multiagent assignment Markov decision processes
Full text PdfPdf (319 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1 table of contents
Budapest, Hungary
SESSION: Coordination/DCOP/resource allocation table of contents
Pages 681-688  
Year of Publication: 2009
ISBN:978-0-9817381-6-1
Authors
Scott Proper  Oregon State University, Corvallis, OR
Prasad Tadepalli  Oregon State University, Corvallis, OR
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Wiley - Blackwell Ltd
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
Publisher
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 33,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called "Assignment-based decomposition" which is based on decomposing the problem of action selection into an upper assignment level and a lower task execution level. The assignment problem is solved by search, while the task execution is solved through coordinated reinforcement learning. We show that this decomposition of the overall problem into two levels scales well and outperforms the state-of-the-art approaches including pure assignment-level search or pure coordinated reinforcement learning. We also show how this approach enables transfer learning from domains with few agents to domains with many agents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Becker, S. Zilberstein, and C. V. Goldman. Solving transition independent decentralized markov decision processes. JAIR, 22:423--455, 2004.
 
2
D. Dolgov and E. Durfee. Optimal resource allocation and policy formulation in loosely-coupled markov decision processes. In AAMAS '04, pages 315--324, June 2004.
 
3
C. V. Goldman and S. Zilberstein. Decentralized control of control of cooperative systems: Categorization and complexity analysis. JAIR, 22:143--174, 2004.
 
4
C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, NIPS '01, pages 1523--1530. MIT Press, 2001.
 
5
 
6
7
 
8
9
 
10
 
11
 
12
S. Proper and P. Tadepalli. Scaling model-based average-reward reinforcement learning for product delivery. In ECML '06, pages 735--742, 2006.
 
13
 
14
 
15
16

Collaborative Colleagues:
Scott Proper: colleagues
Prasad Tadepalli: colleagues