| Efficiently learning linear-linear exponential family predictive representations of state |
| Full text |
Pdf
(331 KB)
|
| Source
|
ICML; Vol. 307
archive
Proceedings of the 25th international conference on Machine learning
table of contents
Helsinki, Finland
Pages 1176-1183
Year of Publication: 2008
ISBN:978-1-60558-205-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 15, Citation Count: 0
|
|
|
ABSTRACT
Exponential Family PSR (EFPSR) models capture stochastic dynamical systems by representing state as the parameters of an exponential family distribution over a shortterm window of future observations. They are appealing from a learning perspective because they are fully observed (meaning expressions for maximum likelihood do not involve hidden quantities), but are still expressive enough to both capture existing models and predict new models. While maximum-likelihood learning algorithms for EFPSRs exist, they are not computationally feasible. We present a new, computationally efficient, learning algorithm based on an approximate likelihood function. The algorithm can be interpreted as attempting to induce stationary distributions of observations, features and states which match their empirically observed counterparts. The approximate likelihood, and the idea of matching stationary distributions, may apply to other models.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Brand, M. (2006). Fast low-rank modifications of the thin singular value decomposition. Linear Algebra and its Applications, 415, 20--30.
|
| |
2
|
|
| |
3
|
Littman, M. L., Sutton, R. S., & Singh, S. (2002). Predictive representations of state. Neural Information Processing Systems (NIPS) (pp. 1555--1561).
|
| |
4
|
Peters, J., Vijayakumar, S., & Schaal, S. (2005). Natural actor-critic. European Conference on Machine Learning (ECML) (pp. 280--291).
|
| |
5
|
Rudary, M. R., Singh, S., & Wingate, D. (2005). Predictive linear-Gaussian models of stochastic dynamical systems. Uncertainty in Artificial Intelligence (pp. 501--508).
|
| |
6
|
|
|