| Learning curve bounds for a Markov decision process with undiscounted rewards |
| Full text |
Pdf
(1.03 MB)
|
| Source
|
Annual Workshop on Computational Learning Theory
archive
Proceedings of the ninth annual conference on Computational learning theory
table of contents
Desenzano del Garda, Italy
Pages: 147 - 156
Year of Publication: 1996
ISBN:0-89791-811-8
|
|
Authors
|
|
Lawrence K. Saul
|
Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA
|
|
Satinder P. Singh
|
Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 10, Citation Count: 2
|
|
|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
J. A. Bucklew. Large deviatwn techniques zn deciszon, simulation, and est~matzon. John Wiley & Sons, New York, 1990.
|
 |
4
|
|
 |
5
|
David Haussler , H. Sebastian Seung , Michael Kearns , Naftali Tishby, Rigorous learning curve bounds from statistical mechanics, Proceedings of the seventh annual conference on Computational learning theory, p.76-87, July 12-15, 1994, New Brunswick, New Jersey, United States
[doi> 10.1145/180139.181018]
|
| |
6
|
K. Huang. Statistical Mechanics. John Wiley &; Sons, New York, NY, 1987.
|
| |
7
|
M. Marcus and H. Mine. A survey of matrix theory and matrix inequalities. Dover, New York, 1992.
|
 |
8
|
|
| |
9
|
H. S. Seung, H. Sompolinsky, and N. Tishby. Statistical mechanics of learning from examples. Physical Review A 45: 6056-6091, 1992.
|
| |
10
|
|
| |
11
|
|
| |
12
|
T. Watkin, A. Rau, and 5I. Biehl. The statistical mechanics of learning a rule. Reviews of Modern Physzcs 65:499-556, 1993.
|
| |
13
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|