| Analysis of two gradient-based algorithms for on-line regression |
| Full text |
Pdf
(1.26 MB)
|
| Source
|
Annual Workshop on Computational Learning Theory
archive
Proceedings of the tenth annual conference on Computational learning theory
table of contents
Nashville, Tennessee, United States
Pages: 163 - 170
Year of Publication: 1997
ISBN:0-89791-891-6
|
|
Author
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 15, Citation Count: 1
|
|
|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
N. Cesa-Bianchi, PM. Long, and M.K. Warmuth. Worst-case quadratic loss bounds for prediction using linear functions and gradient descent. IEEE Transactions on Neural Networks, 7(3):604-619, 1996.
|
| |
2
|
|
| |
3
|
M. Feder, N. Merhav, and M. Gutman. Universal prediction of individual sequences. IEEE Trans. on Information Theory, 38:1258-1270,1992.
|
 |
4
|
|
| |
5
|
D.P. Helmbold, J. Kivinen, and M.K. Warmuth. Worstcase loss bounds for sigmoided neurons. In Advances in Neural Information Processing Systems 8. MIT Press, 1996.
|
| |
6
|
W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58: 13-30, 1963.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
D. Pollard. Convergence of Stochastic Processes. Springer Verlag, 1984.
|
| |
13
|
|
| |
14
|
K. Yamanishi. A decision-theoretic extension of stochastic complexity and its application to learning. Submitted for publication, 1995.
|
|