|
ABSTRACT
Just as POMDPs have been used to reason explicitly about uncertainty in single-agent systems, there has been recent interest in using multi-agent POMDPs to coordinate teams of agents in the presence of uncertainty. Although multi-agent POMDPs are known to be highly intractable, communication at every time step transforms a multi-agent POMDP into a more tractable single-agent POMDP. In this paper, we present an approach that generates "centralized" policies for multi-agent POMDPs at plan-time by assuming the presence of free communication, and at run-time, handles the problem of limited communication resources by reasoning about the use of communication as needed for effective execution. This approach trades off the need to do some computation at execution-time for the ability to generate policies more tractably at plan-time. In our algorithm, each agent, at run-time, models the distribution of possible joint beliefs. Joint actions are selected over this distribution, ensuring that agents remain synchronized. Communication is used to integrate local observations into the team belief only when those observations would improve team performance. We show, both through a detailed example and with experimental results, that our approach allows for effective decentralized execution while avoiding unnecessary instances of communication.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
A. R. Cassandra. POMDP solver software. http://www.cassandra.org/pomdp/code/index.shtml.
|
| |
4
|
Rosemary Emery-Montemerlo , Geoff Gordon , Jeff Schneider , Sebastian Thrun, Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, p.136-143, July 19-23, 2004, New York, New York
[doi> 10.1109/AAMAS.2004.67]
|
| |
5
|
D. Goldberg, V. Cicirello, M. B. Dias, R. Simmons, S. Smith, T. Smith, and A. Stentz. A distributed layered architecture for mobile robot coordination: Application to space exploration. In International NASA Workshop on Planning and Scheduling for Space, 2002.
|
| |
6
|
C. V. Goldman and S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of AI Research, 2004.
|
| |
7
|
E. A. Hansen, D. S. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In National Conference on Artificial Intelligence, 2004.
|
| |
8
|
|
| |
9
|
M. L. Littman, A. R. Cassandra, and L. P. Kaelbling. Learning policies for partially observable environments: Scaling up. In International Conference on Machine Learning, 1995.
|
| |
10
|
R. Nair, D. Pynadath, M. Yokoo, M. Tambe, and S. Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In International Joint Conference on Artificial Intelligence, 2003.
|
| |
11
|
|
| |
12
|
R. Nair, M. Tambe, and S. Marsella. Team formation for reformation in multiagent domains like RoboCupRescue. In RoboCup-2002 International Symposium, 2003.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of AI Research, 2002.
|
| |
17
|
M. Roth, D. Vail, and M. Veloso. A real-time world model for multi-robot teams with high-latency communication. In International Conference on Intelligent Robots and Systems, 2003.
|
| |
18
|
S. Thrun. Monte Carlo POMDPs. In Neural Information Processing Systems, 2000.
|
 |
19
|
|
| |
20
|
P. Xuan, V. Lesser, and S. Zilberstein. Formal modeling of communication decisions in cooperative multiagent systems. In Workshop on Game-Theoretic and Decision-Theoretic Agents, 2000.
|
CITED BY 8
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Giuseppe P. Settembre , Paul Scerri , Alessandro Farinelli , Katia Sycara , Daniele Nardi, A decentralized approach to cooperative situation assessment in multi-robot systems, Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems, May 12-16, 2008, Estoril, Portugal
|
|
|
|
|
|
|
|
|
|
|