| Solving multiagent assignment Markov decision processes |
| Full text |
Pdf
(319 KB)
|
Source
|
International Conference on Autonomous Agents
archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
table of contents
Budapest, Hungary
SESSION: Coordination/DCOP/resource allocation
table of contents
Pages 681-688
Year of Publication: 2009
ISBN:978-0-9817381-6-1
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 33, Citation Count: 0
|
|
|
ABSTRACT
We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called "Assignment-based decomposition" which is based on decomposing the problem of action selection into an upper assignment level and a lower task execution level. The assignment problem is solved by search, while the task execution is solved through coordinated reinforcement learning. We show that this decomposition of the overall problem into two levels scales well and outperforms the state-of-the-art approaches including pure assignment-level search or pure coordinated reinforcement learning. We also show how this approach enables transfer learning from domains with few agents to domains with many agents.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Becker, S. Zilberstein, and C. V. Goldman. Solving transition independent decentralized markov decision processes. JAIR, 22:423--455, 2004.
|
| |
2
|
D. Dolgov and E. Durfee. Optimal resource allocation and policy formulation in loosely-coupled markov decision processes. In AAMAS '04, pages 315--324, June 2004.
|
| |
3
|
C. V. Goldman and S. Zilberstein. Decentralized control of control of cooperative systems: Categorization and complexity analysis. JAIR, 22:143--174, 2004.
|
| |
4
|
C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, NIPS '01, pages 1523--1530. MIT Press, 2001.
|
| |
5
|
|
| |
6
|
Carlos Guestrin , Shobha Venkataraman , Daphne Koller, Context-specific multiagent coordination and planning with factored MDPs, Eighteenth national conference on Artificial intelligence, p.253-259, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
Nicolas Meuleau , Milos Hauskrecht , Kee-Eung Kim , Leonid Peshkin , Leslie Pack Kaelbling , Thomas Dean , Craig Boutilier, Solving very large weakly coupled Markov decision processes, Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence, p.165-172, July 1998, Madison, Wisconsin, United States
|
| |
11
|
|
| |
12
|
S. Proper and P. Tadepalli. Scaling model-based average-reward reinforcement learning for product delivery. In ECML '06, pages 735--742, 2006.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
|