|
ABSTRACT
We present an extension of the Dynamics Based Control (DBC) paradigm to environment models based on Predictive State Representations (PSRs). We show an approximate greedy version of the DBC for PSR model, EMT-PSR, and demonstrate how this algorithm can be applied to solve several control problems. We then provide some classifications and requirements of PSR environment models that are necessary for the EMT-PSR algorithm to operate.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Artificial Intelligence, pages 183--188, 1992.
|
| |
2
|
|
| |
3
|
M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In The 11th Int. Conference on Machine Learning, pages 157--163, 1994.
|
| |
4
|
|
| |
5
|
M. L. Littman, T. L. Dean, and L. P. Kaelbling. On the complexity of solving Markov decision problems. In Proceedings of the 11th Annual Conference on Uncertainty in Artificial Intelligence (UAI-95), pages 394--402, 1995.
|
| |
6
|
M. L. Littman, R. S. Sutton, and S. Singh. Predictive representation of state. In Advances in Neural Information Processing Systems (NIPS-14), pages 1555--1561, 2001.
|
| |
7
|
|
| |
8
|
R. A. McCallum. Overcoming incomplete perception with utile distinction memory. In Proceedings of the 10th International Conference on Machine Learning, 1993.
|
| |
9
|
K. P. Murphy. A survey of POMDP solution techniques. Technical report, University of California at Berkeley, 2000.
|
| |
10
|
R. Parr and S. Russell. Approximating optimal policies for partially observable stochastic domains. In The 14th Int. Joint Conference on Artificial Intelligence, 1995.
|
| |
11
|
M. L. Puterman. Markov Decision Processes. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. Wiley-Interscience, New York, 1994.
|
| |
12
|
Z. Rabinovich and J. S. Rosenschein. Extended Markov Tracking with an application to control. In The Workshop on Agent Tracking: Modeling Other Agents from Observations, at AAMAS'04, pages 95--100, New-York, July 2004.
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
|