ACM Home Page
Please provide us with feedback. Feedback
Efficiently learning linear-linear exponential family predictive representations of state
Full text PdfPdf (331 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 1176-1183  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
David Wingate  University of Michigan, Ann Arbor, MI
Satinder Singh  University of Michigan, Ann Arbor, MI
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 15,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390304
What is a DOI?

ABSTRACT

Exponential Family PSR (EFPSR) models capture stochastic dynamical systems by representing state as the parameters of an exponential family distribution over a shortterm window of future observations. They are appealing from a learning perspective because they are fully observed (meaning expressions for maximum likelihood do not involve hidden quantities), but are still expressive enough to both capture existing models and predict new models. While maximum-likelihood learning algorithms for EFPSRs exist, they are not computationally feasible. We present a new, computationally efficient, learning algorithm based on an approximate likelihood function. The algorithm can be interpreted as attempting to induce stationary distributions of observations, features and states which match their empirically observed counterparts. The approximate likelihood, and the idea of matching stationary distributions, may apply to other models.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Brand, M. (2006). Fast low-rank modifications of the thin singular value decomposition. Linear Algebra and its Applications, 415, 20--30.
 
2
 
3
Littman, M. L., Sutton, R. S., & Singh, S. (2002). Predictive representations of state. Neural Information Processing Systems (NIPS) (pp. 1555--1561).
 
4
Peters, J., Vijayakumar, S., & Schaal, S. (2005). Natural actor-critic. European Conference on Machine Learning (ECML) (pp. 280--291).
 
5
Rudary, M. R., Singh, S., & Wingate, D. (2005). Predictive linear-Gaussian models of stochastic dynamical systems. Uncertainty in Artificial Intelligence (pp. 501--508).
 
6

Collaborative Colleagues:
David Wingate: colleagues
Satinder Singh: colleagues