|
ABSTRACT
State estimation consists of updating an agent's belief given executed actions and observed evidence to date. In single agent environments, the state estimation can be formalized using the Bayes filter. Exact estimation can be performed in simple cases, but approximate techniques, like particle filtering, have been used in more realistic cases. This paper extends the particle filter to multiagent settings resulting in the interactive particle filter. The main difficulty we tackle is that to fully represent an agent's beliefs in such environments, one has to specify probability distributions over the physical state and over the beliefs of other agents. This leads to interactive hierarchical belief systems first developed in game theory. Since the update of such beliefs proceeds recursively, the interactive particle filter samples and propagates on all levels of the belief hierarchy. We present algorithms, discuss some of their properties, and illustrate the performance of our implementation using simple examples.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. J. Aumann and A. Heifetz. Handbook of Game Theory with Economic Applications, volume 3. Elsevier Science, 2002.
|
| |
2
|
P. Battigalli and G. Bonanno. Recent results on belief, knowledge, and the epistemic foundations of game theory. Research in Economics, 53:149--225, 1999.
|
| |
3
|
A. Brandenburger and E. Dekel. Hierarchies of beliefs and common knowledge. Journal of Economic Theory, 59:189--198, 1993.
|
| |
4
|
A. Doucet, N. D. Freitas, and N. Gordon. Sequential Monte Carlo Methods in Practice. Springer Verlag, 2001.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
P. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multiagent settings. Journal of AI Research, 23, 2005.
|
| |
9
|
N. Gordon, D. Salmond, and A. Smith. Novel approach to non-linear/non-gaussian bayesian state estimation. IEEE Proceedings-F, 140(2):107--113, 1993.
|
| |
10
|
J. C. Harsanyi. Games with incomplete information played by 'bayesian' players. Mgmt. Science, 14(3):159--182, 1967.
|
| |
11
|
A. Heifetz and D. Samet. Topology-free typology of beliefs. Journal of Economic Theory, 82:324--341, 1998.
|
| |
12
|
|
| |
13
|
|
| |
14
|
J. Mertens and S. Zamir. Formulation of bayesian analysis for games with incomplete information. International Journal of Game Theory, 14:1--29, 1985.
|
| |
15
|
|
| |
16
|
R. Smallwood and E. Sondik. The optimal control of partially observable markov decision processes over a finite horizon. Operations Research, 21:1071--1088, 1973.
|
| |
17
|
S. Thrun. Monte carlo pomdps. In NIPS 12, pages 1064--1070, 2000.
|
|