|
ABSTRACT
In this paper we introduce Dynamics Based Control (DBC), an approach to planning and control of an agent in stochastic environments. Unlike existing approaches, which seek to optimize expected rewards (e.g., in Partially Observable Markov Decision Problems (POMDPs)), DBC optimizes system behavior towards specified system dynamics. We show that a recently developed planning and control approach, Extended Markov Tracking (EMT) is an instantiation of DBC. EMT employs greedy action selection to provide an efficient control algorithm in Markovian environments. We exploit this efficiency in a set of experiments that applied multi-target EMT to a class of area-sweeping problems (searching for moving targets). We show that such problems can be naturally defined and efficiently solved using the DBC framework, and its EMT instantiation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
J. A. Bilmes. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and Hidden Markov Models. Technical Report TR-97-021, Department of Electrical Engeineering and Computer Science, University of California at Berkeley, 1998.
|
| |
3
|
|
| |
4
|
M. E. desJardins, E. H. Durfee, C. L. Ortiz, and M. J. Wolverton. A survey of research in distributed, continual planning. AI Magazine, 4:13--22, 1999.
|
| |
5
|
|
| |
6
|
W. S. Lim. A rendezvous-evasion game on discrete locations with joint randomization. Advances in Applied Probability, 29(4):1004--1017, December 1997.
|
| |
7
|
M. L. Littman, T. L. Dean, and L. P. Kaelbling. On the complexity of solving Markov decision problems. In Proceedings of the 11th Annual Conference on Uncertainty in Artificial Intelligence (UAI-95), pages 394--402, 1995.
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for pomdps. In International Joint Conference on Artificial Intelligence (IJCAI), pages 1025--1032, August 2003.
|
| |
12
|
|
| |
13
|
Z. Rabinovich and J. S. Rosenschein. Extended Markov Tracking with an application to control. In The Workshop on Agent Tracking: Modeling Other Agents from Observations, at the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pages 95--100, New York, July 2004.
|
 |
14
|
|
 |
15
|
|
| |
16
|
R. F. Stengel. Optimal Control and Estimation. Dover Publications, 1994.
|
 |
17
|
M. Tambe , E. Bowring , H. Jung , G. Kaminka , R. Maheswaran , J. Marecki , P. J. Modi , R. Nair , S. Okamoto , J. P. Pearce , P. Paruchuri , D. Pynadath , P. Scerri , N. Schurr , P. Varakantham, Conflicts in teamwork: hybrids to the rescue, Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems, p.3-10, July 25-29, 2005, The Netherlands
[doi> 10.1145/1082473.1082474]
|
|