ACM Home Page
Please provide us with feedback. Feedback
Learning agents for uncertain environments (extended abstract)
Full text PdfPdf (440 KB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the eleventh annual conference on Computational learning theory table of contents
Madison, Wisconsin, United States
Pages: 101 - 103  
Year of Publication: 1998
ISBN:1-58113-057-0
Author
Stuart Russell  Computer Science Division, University of California, Berkeley, CA
Sponsors
University of Wisconsin : University of Wisconsin
UC @ Santa Cruz : UC @ Santa Cruz
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 29,   Citation Count: 2
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/279943.279964
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Astrom, K. J. (1965). Optimal control of Markov decision processes with incomplete state estimation. J. Math. Anal. Applic., 10, 174-205.
 
2
 
3
 
4
Binder, J., Murphy, K., & Russell, S. (1997b). Space-efficient inference in dynamic probabilistic networks. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97) Nagoya, Japan. Morgan Kaufmann.
 
5
Boyen, X., & Koller, D. (1998). Tractable inference for complex stochastic processes. In Proc. 14th Annual Conference on Uncertainty in AI (UAD. to appear.
 
6
 
7
Doya, K., & Sejnowski, T. (1995). A novel reinforcement model of birdsong vocalization learning. In Tesauro, G., Touretzky, D., & Leen, T (Eds.), Advances in Neural Information Processing Systems, Vol. 8, pp. 101-8 Denver, CO. M1T Press.
 
8
Farley, C. T., & Taylor, C. R. (1991). A mechanical trigger for the trot-gallop transition in horses. Science, 253(5017), 306- 308.
 
9
Friedman, N., Murphy, K., & Russell, S. (1998). Leaming the structure of dynamic probabilistic networks. In Uncertainty in Artificial Intelligence : Proceedings of the Fourteenth Conference Madison, Wisconsin. Morgan Kaufmann.
 
10
Hoyt, D., & Taylor, C. (1981). Gait and the energetics of locomotion in horses. Nature, 292,239-240.
 
11
Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4,237-285.
 
12
Kanazawa, K., Koller, D., & Russell, S. (1995). Stochastic simulation algorithms for dynamic probabilistic networks. In Eleventh Conference, pp. 346-351 Montreal, Canada. Morgan Kaufmann.
 
13
Keeney, R. L., & Raiffa, H. (1976). Decisions with Multiple Objectives: Preferences and Value Tradeoffs. Wiley, New York.
 
14
McCallum, A. R. (1993). Overcoming incomplete perception with utile distinction memory. In Proceedings of the Tenth International Conference on Machine Learning, pp. 190-196 Amherst, Massachusetts. Morgan Kaufmann.
 
15
Montague, P. R., Dayan, P., Person, C., & Sejnowski, T. J. (1995). Bee foraging in uncertain environments using predictive hebbian learning. Nature, 377, 725-728.
 
16
Parr, R., & Russell, S. (1995). Approximating optimal policies for partially observable stochastic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95) Montreal, Canada. Morgan Kaufmann.
 
17
 
18
 
19
Rust, J. (1994). Do people behave according to bellman's principal of optimality?. Submitted to Journal of Economic Perspecfives.
 
20
Sargent, T. J. (1978). Estimation of dynamic labor demand schedules under rational expectations. Journal of Political Economy, 86(6), 1009-1044.
 
21
 
22
 
23