ACM Home Page
Please provide us with feedback. Feedback
Reinforcement learning and mistake bounded algorithms
Full text PdfPdf (813 KB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the twelfth annual conference on Computational learning theory table of contents
Santa Cruz, California, United States
Pages: 183 - 192  
Year of Publication: 1999
ISBN:1-58113-167-4
Author
Yishay Mansour  AT&T Labs and Tel-Aviv University
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Univ. of California, : University of California at Santa Cruz
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 11,   Citation Count: 1
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/307400.307437
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
ACBFS95
 
AS92
N. Alon and J. Spencer. The Probabilistic Method. Wiley, 1992.
 
Bel57
 
Ber87
 
Ber95
 
BT96
 
CLR90
 
How60
R. Howard. Dynamic Programming and Markov Processes. MIT Press, 1960.
 
KMN99
M. Kearns, Y. Mansour, and A. Ng. Approximate planning in large pomdps via reusable trajectories.
 
KV94
 
Lit88
 
Lit96
 
PT87
 
SB98
 
Tes94
 
Vap82