ACM Home Page
Please provide us with feedback. Feedback
A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem
Full text PdfPdf (496 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2002 ACM symposium on Applied computing table of contents
Madrid, Spain
SESSION: Agents, interactions, mobility and systems table of contents
Pages: 57 - 62  
Year of Publication: 2002
ISBN:1-58113-445-2
Authors
Iadine Chades  MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
Bruno Scherrer  MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
François Charpillet  MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 35,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/508791.508804
What is a DOI?

ABSTRACT

Defining the behaviour of a set of situated agents, such that a collaborative problem can be solved is a key issue in multi-agent systems. In this paper, we formulate this problem from the decision theoretic perspective using the framework of Decentralized Partially Observable Markov Decision Processes (DEC-POMDP). Formulating the coordination problem in this way provides a formal foundation for study of cooperation activities. But, as it has been recently shown solving DEC-POMDP is NEXP-complete and thus it is not a realistic approach for the design of agent cooperation policies. However, we demonstrate in this paper that it is not completely desperate. Indeed, we propose an heuristic approach for solving DEC-POMDP when agents are memory-less and when the global reward function can be broken up into a sum of local reward functions. We demonstrate experimentally on an example (the so-called pursuit problem) that this heuristic is efficient within a few iteration steps.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
E. Bonabeau, M. Dorigo, and G. Theraulaz. Inspiration for optimization from social insect behaviour, 2000.
 
3
 
4
 
5
M. Puterman. Markov decision processes, 1994.
 
6
 
7
P. Xuan, V. Lesser, and S. Zilberstein. Formal modeling of communication decisions in cooperative multi-agent systems. In The Second Workshop on Game Theoric and Decision Theoric Agents, 2000.

CITED BY  7

Collaborative Colleagues:
Iadine Chades: colleagues
Bruno Scherrer: colleagues
François Charpillet: colleagues