ACM Home Page
Please provide us with feedback. Feedback
Dynamics based control with PSRs
Full text PdfPdf (396 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1 table of contents
Estoril, Portugal
SESSION: Agent and multi-agent learning table of contents
Pages 387-394  
Year of Publication: 2008
ISBN:978-0-9817381-0-9
Authors
Ariel Adam  The Hebrew University of Jerusalem
Zinovi Rabinovich  Southampton University
Jeffrey S. Rosenschein  The Hebrew University of Jerusalem
Sponsors
ACM: Association for Computing Machinery
AAAI : Association for the Advancement of Artifical Intelligence
Publisher
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 34,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

We present an extension of the Dynamics Based Control (DBC) paradigm to environment models based on Predictive State Representations (PSRs). We show an approximate greedy version of the DBC for PSR model, EMT-PSR, and demonstrate how this algorithm can be applied to solve several control problems. We then provide some classifications and requirements of PSR environment models that are necessary for the EMT-PSR algorithm to operate.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Artificial Intelligence, pages 183--188, 1992.
 
2
 
3
M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In The 11th Int. Conference on Machine Learning, pages 157--163, 1994.
 
4
 
5
M. L. Littman, T. L. Dean, and L. P. Kaelbling. On the complexity of solving Markov decision problems. In Proceedings of the 11th Annual Conference on Uncertainty in Artificial Intelligence (UAI-95), pages 394--402, 1995.
 
6
M. L. Littman, R. S. Sutton, and S. Singh. Predictive representation of state. In Advances in Neural Information Processing Systems (NIPS-14), pages 1555--1561, 2001.
 
7
 
8
R. A. McCallum. Overcoming incomplete perception with utile distinction memory. In Proceedings of the 10th International Conference on Machine Learning, 1993.
 
9
K. P. Murphy. A survey of POMDP solution techniques. Technical report, University of California at Berkeley, 2000.
 
10
R. Parr and S. Russell. Approximating optimal policies for partially observable stochastic domains. In The 14th Int. Joint Conference on Artificial Intelligence, 1995.
 
11
M. L. Puterman. Markov Decision Processes. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. Wiley-Interscience, New York, 1994.
 
12
Z. Rabinovich and J. S. Rosenschein. Extended Markov Tracking with an application to control. In The Workshop on Agent Tracking: Modeling Other Agents from Observations, at AAMAS'04, pages 95--100, New-York, July 2004.
13
14
15
 
16
 
17

Collaborative Colleagues:
Ariel Adam: colleagues
Zinovi Rabinovich: colleagues
Jeffrey S. Rosenschein: colleagues