ACM Home Page
Please provide us with feedback. Feedback
Planning with continuous resources for agent teams
Full text PdfPdf (482 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Agent reasoning/deliberation/decision mechanisms table of contents
Pages 1089-1096  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Janusz Marecki  IBM T.J. Watson Research Center, Yorktown Heights, NY
Milind Tambe  University of Southern California, Los Angeles, CA
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 32,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Many problems of multiagent planning under uncertainty require distributed reasoning with continuous resources and resource limits. Decentralized Markov Decision Problems (Dec-MDPs) are well-suited to address such problems, but unfortunately, prior Dec-MDP approaches either discretize resources at the expense of speed and quality guarantees, or avoid discretization only by limiting agents' action choices or interactions (e.g. assumption of transition independence). To address these shortcomings, this paper proposes M-DPFP, a novel algorithm for planning with continuous resources for agent teams, with three key features: (i) it maintains the agent team interaction graph to identify and prune the suboptimal policies and to allow the agents to be transition dependent, (ii) it operates in a continuous space of probability functions to provide the error bound on the solution quality and finally (iii) it focuses the search for policies on the most relevant parts of this search space to allow for a systematic trade-off of solution quality for speed. Our experiments show that M-DPFP finds high quality solutions and exhibits superior performance when compared with a discretization-based approach. We also show that M-DPFP is applicable to solving problems that are beyond the scope of existing approaches.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Transition-Independent Dec-MDPs. In AAMAS, 2003.
 
3
E. Benazera. Solving decentralized continuous Markov decision problems with structured reward. In KI, 2007.
 
4
 
5
 
6
K. Decker and V. Lesser. Designing a Family of Coordination Algorithms. ICMAS-95, January 1995.
 
7
 
8
L. Li and M. Littman. Lazy approximation for solving continuous finite-horizon MDPs. In AAAI, 2005.
 
9
J. Marecki, S. Koenig, and M. Tambe. A fast analytical algorithm for solving MDPs with real-valued resources. In IJCAI, 2007.
10
 
11
J. Marecki and M. Tambe. Towards faster planning with continuous resources in stochastic domains. In AAAI, 2008.
 
12
Mausam, E. Benazera, R. I. Brafman, N. Meuleau, and E. A. Hansen. Planning with continuous resources in stochastic domains. In IJCAI, 2005.
 
13
D. Musliner, E. Durfee, J. Wu, D. Dolgov, R. Goldman, and M. Boddy. Coordinated plan management using multiagent MDPs. In AAAI Spring Symposium, 2006.
 
14
R. Nair, M. Tambe, M. Yokoo, D. Pynadath, and S. Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In IJCAI, 2003.
 
15
R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synergy of distributed constraint optimization and POMDPs. In IJCAI, 2005.
 
16
17
 
18

Collaborative Colleagues:
Janusz Marecki: colleagues
Milind Tambe: colleagues