| Graphical models for online solutions to interactive POMDPs |
| Full text |
Pdf
(394 KB)
|
Source
|
International Conference on Autonomous Agents
archive
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
table of contents
Honolulu, Hawaii
SESSION: Multiagent planning: full papers
table of contents
Article No. 217
Year of Publication: 2007
ISBN:978-81-904262-7-5
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 52, Citation Count: 2
|
|
|
ABSTRACT
We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear than the previous representation. These graphical models called interactive dynamic influence diagrams (I-DIDs) seek to explicitly model the structure that is often present in real-world problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that I-POMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent online as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-DIDs may be applied and demonstrate their usefulness.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. J. Aumann. Interactive epistemology i: Knowledge. International Journal of Game Theory, 28:263--300, 1999.
|
| |
2
|
|
| |
3
|
A. Brandenburger and E. Dekel. Hierarchies of beliefs and common knowledge. Journal of Economic Theory, 59:189--198, 1993.
|
| |
4
|
C. Camerer. Behavioral Game Theory: Experiments in Strategic Interaction. Princeton University Press, 2003.
|
| |
5
|
E. Fehr and S. Gachter. Cooperation and punishment in public goods experiments. American Economic Review, 90(4):980--994, 2000.
|
| |
6
|
D. Fudenberg and D. K. Levine. The Theory of Learning in Games. MIT Press, 1998.
|
| |
7
|
D. Fudenberg and J. Tirole. Game Theory. MIT Press, 1991.
|
 |
8
|
|
| |
9
|
P. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multiagent settings. JAIR, 24:49--79, 2005.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
D. Koller and B. Milch. Multi-agent influence diagrams for representing and solving games. In IJCAI, pages 1027--1034, 2001.
|
 |
15
|
|
 |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
CITED BY 2
|
|
Yifeng Zeng , Prashant Doshi , Qiongyu Chen, Approximate solutions of interactive dynamic influence diagrams using model clustering, Proceedings of the 22nd national conference on Artificial intelligence, p.782-787, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|