ACM Home Page
Please provide us with feedback. Feedback
Learning curve bounds for a Markov decision process with undiscounted rewards
Full text PdfPdf (1.03 MB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the ninth annual conference on Computational learning theory table of contents
Desenzano del Garda, Italy
Pages: 147 - 156  
Year of Publication: 1996
ISBN:0-89791-811-8
Authors
Lawrence K. Saul  Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA
Satinder P. Singh  Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA
Sponsors
Univ degli Studi de Milano : Universite degli Studi de Milano
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 13,   Citation Count: 2
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/238061.238084
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
J. A. Bucklew. Large deviatwn techniques zn deciszon, simulation, and est~matzon. John Wiley & Sons, New York, 1990.
4
5
 
6
K. Huang. Statistical Mechanics. John Wiley &; Sons, New York, NY, 1987.
 
7
M. Marcus and H. Mine. A survey of matrix theory and matrix inequalities. Dover, New York, 1992.
8
 
9
H. S. Seung, H. Sompolinsky, and N. Tishby. Statistical mechanics of learning from examples. Physical Review A 45: 6056-6091, 1992.
 
10
 
11
 
12
T. Watkin, A. Rau, and 5I. Biehl. The statistical mechanics of learning a rule. Reviews of Modern Physzcs 65:499-556, 1993.
 
13


Collaborative Colleagues:
Lawrence K. Saul: colleagues
Satinder P. Singh: colleagues