| Reinforcement learning and mistake bounded algorithms |
| Full text |
Pdf
(813 KB)
|
| Source
|
Annual Workshop on Computational Learning Theory
archive
Proceedings of the twelfth annual conference on Computational learning theory
table of contents
Santa Cruz, California, United States
Pages: 183 - 192
Year of Publication: 1999
ISBN:1-58113-167-4
|
|
Author
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 9, Citation Count: 1
|
|
|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
ACBFS95
|
|
| |
AS92
|
N. Alon and J. Spencer. The Probabilistic Method. Wiley, 1992.
|
| |
Bel57
|
|
| |
Ber87
|
|
| |
Ber95
|
|
| |
BT96
|
|
| |
CLR90
|
|
| |
How60
|
R. Howard. Dynamic Programming and Markov Processes. MIT Press, 1960.
|
| |
KMN99
|
M. Kearns, Y. Mansour, and A. Ng. Approximate planning in large pomdps via reusable trajectories.
|
| |
KV94
|
|
| |
Lit88
|
|
| |
Lit96
|
|
| |
PT87
|
|
| |
SB98
|
|
| |
Tes94
|
|
| |
Vap82
|
|
Peer to Peer - Readers of this Article have also read:
-
The effect of latency on user performance in Warcraft III
Proceedings of the 2nd workshop on Network and system support for games
Nathan Sheldon
, Eric Girard
, Seth Borg
, Mark Claypool
, Emmanuel Agu
-
Learning subjective relevance to facilitate information access
Proceedings of the fourth international conference on Information and knowledge management
James R. Chen
, Nathalie Mathé
-
Using semantic information measures to evaluate learning strategies
Proceedings of the 14th annual Southeast regional conference on
Neal S. Coulter
-
Virtual Collaborative Learning: A Comparison between Face-to-Face Tutored Video Instruction (TVI) and Distributed Tutored Video Instruction (DTVI)
John Dutra
, James F. Gibbons
, Robert L. Pannoni
, Michael J. Sipusic
, Randall B. Smith
, William R. Sutherland
-
On learning from noisy and incomplete examples
Proceedings of the eighth annual conference on Computational learning theory
Scott E. Decatur
, Rosario Gennaro
|