ACM Home Page
Please provide us with feedback. Feedback
Reasoning about joint beliefs for execution-time communication decisions
Full text PdfPdf (422 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems table of contents
The Netherlands
SESSION: Papers: argumentation and dialog table of contents
Pages: 786 - 793  
Year of Publication: 2005
ISBN:1-59593-093-0
Authors
Maayan Roth  Carnegie Mellon University, Pittsburgh, PA
Reid Simmons  Carnegie Mellon University, Pittsburgh, PA
Manuela Veloso  Carnegie Mellon University, Pittsburgh, PA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 42,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1082473.1082593
What is a DOI?

ABSTRACT

Just as POMDPs have been used to reason explicitly about uncertainty in single-agent systems, there has been recent interest in using multi-agent POMDPs to coordinate teams of agents in the presence of uncertainty. Although multi-agent POMDPs are known to be highly intractable, communication at every time step transforms a multi-agent POMDP into a more tractable single-agent POMDP. In this paper, we present an approach that generates "centralized" policies for multi-agent POMDPs at plan-time by assuming the presence of free communication, and at run-time, handles the problem of limited communication resources by reasoning about the use of communication as needed for effective execution. This approach trades off the need to do some computation at execution-time for the ability to generate policies more tractably at plan-time. In our algorithm, each agent, at run-time, models the distribution of possible joint beliefs. Joint actions are selected over this distribution, ensuring that agents remain synchronized. Communication is used to integrate local observations into the team belief only when those observations would improve team performance. We show, both through a detailed example and with experimental results, that our approach allows for effective decentralized execution while avoiding unnecessary instances of communication.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
A. R. Cassandra. POMDP solver software. http://www.cassandra.org/pomdp/code/index.shtml.
 
4
 
5
D. Goldberg, V. Cicirello, M. B. Dias, R. Simmons, S. Smith, T. Smith, and A. Stentz. A distributed layered architecture for mobile robot coordination: Application to space exploration. In International NASA Workshop on Planning and Scheduling for Space, 2002.
 
6
C. V. Goldman and S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of AI Research, 2004.
 
7
E. A. Hansen, D. S. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In National Conference on Artificial Intelligence, 2004.
 
8
 
9
M. L. Littman, A. R. Cassandra, and L. P. Kaelbling. Learning policies for partially observable environments: Scaling up. In International Conference on Machine Learning, 1995.
 
10
R. Nair, D. Pynadath, M. Yokoo, M. Tambe, and S. Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In International Joint Conference on Artificial Intelligence, 2003.
 
11
 
12
R. Nair, M. Tambe, and S. Marsella. Team formation for reformation in multiagent domains like RoboCupRescue. In RoboCup-2002 International Symposium, 2003.
 
13
 
14
 
15
 
16
D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of AI Research, 2002.
 
17
M. Roth, D. Vail, and M. Veloso. A real-time world model for multi-robot teams with high-latency communication. In International Conference on Intelligent Robots and Systems, 2003.
 
18
S. Thrun. Monte Carlo POMDPs. In Neural Information Processing Systems, 2000.
19
 
20
P. Xuan, V. Lesser, and S. Zilberstein. Formal modeling of communication decisions in cooperative multiagent systems. In Workshop on Game-Theoretic and Decision-Theoretic Agents, 2000.

CITED BY  8

Collaborative Colleagues:
Maayan Roth: colleagues
Reid Simmons: colleagues
Manuela Veloso: colleagues