ACM Home Page
Please provide us with feedback. Feedback
Improved approximation of interactive dynamic influence diagrams using discriminative model updates
Full text PdfPdf (432 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Planning/search table of contents
Pages 907-914  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Prashant Doshi  University of Georgia, Athens, GA
Yifeng Zeng  Aalborg University, Aalborg, Denmark
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 15,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Interactive dynamic influence diagrams (I-DIDs) are graphical models for sequential decision making in uncertain settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. We formalize the concept of a minimal model set, which facilitates qualitative comparisons between different approximation techniques. We then present a new approximation technique that minimizes the space of candidate models by discriminating between model updates. We empirically demonstrate that our approach improves significantly in performance on the previous clustering based approximation technique.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
P. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multiagent settings. JAIR, 24:49--79, 2005.
 
4
D. Koller and B. Milch. Multi-agent IDs for representing and solving games. In IJCAI, pages 1027--1034, 2001.
 
5
J. Pineau, G. Gordon, and S. Thrun. Anytime point-based value iteration for large pomdps. JAIR, 27:335--380, 2006.
 
6
D. Pynadath and S. Marsella. Minimal mental models. In AAAI, pages 1038--1044, 2007.
7
 
8
S. Seuken and S. Zilberstein. Improved memory bounded dynamic programming for decentralized pomdps. In UAI, pages 2009--2015, 2007.
 
9
R. Smallwood and E. Sondik. The optimal control of partially observable markov decision processes over a finite horizon. OR, 21:1071--1088, 1973.
 
10
 
11
J. A. Tatman and R. D. Shachter. Dynamic programming and influence diagrams. IEEE Trans. SMC, 20(2):365--379, 1990.

Collaborative Colleagues:
Prashant Doshi: colleagues
Yifeng Zeng: colleagues