|
ABSTRACT
In multi-agent cooperation, agents share a common goal, which is evaluated through a global utility function. However, an agent typically cannot observe the global state of an uncertain environment, and therefore they must communicate with each other in order to share the information needed for deciding which actions to take. We argue that, when communication incurs a cost (due to resource consumption, for example), whether to communicate or not also becomes a decision to make. Hence, communication decision becomes part of the overall agent decision problem. In order to explicitly address this problem, we present a multi-agent extension to Markov decision processes in which communication can be modeled as an explicit action that incurs a cost. This framework provides a foundation for a quantified study of agent coordination policies and provides both motivation and insight to the design of heuristic approaches. An example problem is studied under this framework. From this example we can see the impact communication policies have on the overall agent policies, and what implications we can find toward the design of agent coordination policies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Aicardi, F. Davoli, and R. Minciardi. Decentralized optimal control of markov chains with a common past information set. IEEE Transactions on Automatic Control, AC-32:1028-1031, 1987.
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
E. Hansen, A. Barto, and S. Zilberstein. Reinforcement learning for mixed open-loop and closed-loop control. In Proceedings of the Ninth Neural Information Processing Systems Conference, December 1996.
|
| |
7
|
E. A. Hansen and S. Zilberstein. Monitoring the progress of anytime problem-solving. In Proceedings of the 13th National Conference onArtificial Intelligence, pages 1229-1234, 1996.
|
| |
8
|
Y. C. Ho and T. S. Chang. Another look at the nonclassical information problem. IEEE Transactions on Automatic Control, AC-25:537-540, 1980.
|
| |
9
|
K. Hsu and S. I. Marcus. Decentralized control of finite state markov processes. IEEE Transactions on Automatic Control, AC-27:426-431, 1982.
|
| |
10
|
M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proc. 11th International Conf. on Machine Learning, pages 157-163, 1994.
|
| |
11
|
|
| |
12
|
|
| |
13
|
N. R. Sandell, P. Varaiya, M. Athans, and M. Safonov. Survey of decentralized control methods for large scale systems. IEEE Transactions on Automatic Control, AC-23:108-128, 1978.
|
| |
14
|
J. N. Tsitsiklis and M. Athans. On the complexity of decentralized decision making and detection problems. IEEE Transactions on Automatic Control, AC-30:440-446, 1985.
|
| |
15
|
H. S. Witsenhausen. A counterexample in stochastic optimum control. SIAM Journal on Control, 6(1):138-147, 1968.
|
| |
16
|
T. Yoshikawa. Decomposition of dynamic team decision problems. IEEE Transactions on Automatic Control, AC-23:443-445, 1978.
|
CITED BY 40
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yang Xu , Paul Scerri , Bin Yu , Steven Okamoto , Michael Lewis , Katia Sycara, An integrated token-based algorithm for scalable coordination, Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems, July 25-29, 2005, The Netherlands
|
|
|
|
|
|
Raz Lin , Daphna Dor-Shifer , Saar Rosenberg , Sarit Kraus , David Sarne, Towards the fourth generation of cellular networks: improving performance using distributed negotiation, Proceedings of the 9th ACM international symposium on Modeling analysis and simulation of wireless and mobile systems, October 02-06, 2006, Terromolinos, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Rosemary Emery-Montemerlo , Geoff Gordon , Jeff Schneider , Sebastian Thrun, Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, p.136-143, July 19-23, 2004, New York, New York
|
|
|
Paul Scerri , Yang Xu , Elizabeth Liao , Justin Lai , Katia Sycara, Scaling Teamwork to Very Large Teams, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, p.888-895, July 19-23, 2004, New York, New York
|
|
|
|
|
|
|
|
|
Maxim Peysakhov , Donovan Artz , Evan Sultanik , William Regli, Network Awareness for Mobile Agents on Ad Hoc Networks, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, p.368-376, July 19-23, 2004, New York, New York
|
|
|
V. Lesser , K. Decker , T. Wagner , N. Carver , A. Garvey , B. Horling , D. Neiman , R. Podorozhny , M. Nagendra Prasad , A. Raja , R. Vincent , P. Xuan , X. Q. Zhang, Evolution of the GPGP/TÆMS Domain-Independent Coordination Framework, Autonomous Agents and Multi-Agent Systems, v.9 n.1-2, p.87-143, July-September 2004
|
|
|
|
|
|
|
|
|
Raz Lin , Daphna Dor-Shifer , Sarit Kraus , David Sarne, Local negotiation in cellular networks: from theory to practice, Proceedings of the 18th conference on Innovative applications of artificial intelligence, p.1801-1807, July 16-20, 2006, Boston, Massachusetts
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
R. Nair , M. Tambe , M. Yokoo , D. Pynadath , S. Marsella, Taming decentralized POMDPs: towards efficient policy computation for multiagent settings, Proceedings of the 18th international joint conference on Artificial intelligence, p.705-711, August 09-15, 2003, Acapulco, Mexico
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.11
Distributed Artificial Intelligence
Subjects:
Multiagent systems
Additional Classification:
H.
Information Systems
H.4
INFORMATION SYSTEMS APPLICATIONS
H.4.2
Types of Systems
Subjects:
Decision support (e.g., MIS)
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.8
Problem Solving, Control Methods, and Search
Subjects:
Heuristic methods
General Terms:
Algorithms,
Design,
Experimentation,
Management,
Measurement,
Performance,
Theory
Keywords:
MDP,
coordinating multiple agents,
multi-agent communication/collaboration
|