| A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem |
| Full text |
Pdf
(496 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2002 ACM symposium on Applied computing
table of contents
Madrid, Spain
SESSION: Agents, interactions, mobility and systems
table of contents
Pages: 57 - 62
Year of Publication: 2002
ISBN:1-58113-445-2
|
|
Authors
|
|
Iadine Chades
|
MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
|
|
Bruno Scherrer
|
MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
|
|
François Charpillet
|
MAIA Team, LORIA, B.P. 239 -54506, Vandoeuvre-Les-Nancy, France
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 31, Citation Count: 7
|
|
|
ABSTRACT
Defining the behaviour of a set of situated agents, such that a collaborative problem can be solved is a key issue in multi-agent systems. In this paper, we formulate this problem from the decision theoretic perspective using the framework of Decentralized Partially Observable Markov Decision Processes (DEC-POMDP). Formulating the coordination problem in this way provides a formal foundation for study of cooperation activities. But, as it has been recently shown solving DEC-POMDP is NEXP-complete and thus it is not a realistic approach for the design of agent cooperation policies. However, we demonstrate in this paper that it is not completely desperate. Indeed, we propose an heuristic approach for solving DEC-POMDP when agents are memory-less and when the global reward function can be broken up into a sum of local reward functions. We demonstrate experimentally on an example (the so-called pursuit problem) that this heuristic is efficient within a few iteration steps.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
E. Bonabeau, M. Dorigo, and G. Theraulaz. Inspiration for optimization from social insect behaviour, 2000.
|
| |
3
|
|
| |
4
|
|
| |
5
|
M. Puterman. Markov decision processes, 1994.
|
| |
6
|
|
| |
7
|
P. Xuan, V. Lesser, and S. Zilberstein. Formal modeling of communication decisions in cooperative multi-agent systems. In The Second Workshop on Game Theoric and Decision Theoric Agents, 2000.
|
CITED BY 7
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
R. Nair , M. Tambe , M. Yokoo , D. Pynadath , S. Marsella, Taming decentralized POMDPs: towards efficient policy computation for multiagent settings, Proceedings of the 18th international joint conference on Artificial intelligence, p.705-711, August 09-15, 2003, Acapulco, Mexico
|
|