ACM Home Page
Please provide us with feedback. Feedback
Cross-validation for binary classification by real-valued functions: theoretical analysis
Full text PdfPdf (1.49 MB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the eleventh annual conference on Computational learning theory table of contents
Madison, Wisconsin, United States
Pages: 218 - 229  
Year of Publication: 1998
ISBN:1-58113-057-0
Authors
Martin Anthony  Department of Mathematics, London School of Economics, Houghton Street, London WC2A 2AE, U.K.
Sean B. Holden  Department of Computer Science, University College London, Gower Street, London WC1E 6BT, U.K.
Sponsors
University of Wisconsin : University of Wisconsin
UC @ Santa Cruz : UC @ Santa Cruz
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 16,   Citation Count: 2
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/279943.279987
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
N. Alon, S. Ben-David, N. Cesa-Bianchi, and D. Haussler. Scale-sensitive dimensions, uniform convergence, and learnability. In Proceedings of the Symposium on Foundations of Computer Science. IEEE Press, 1993.
 
2
 
3
 
4
 
5
 
6
 
7
P. Bartlett. The sample complexity of pattern classification with neural networks: the size of the weights is more important that the size of the network. Report, Department of Systems Engineering, Australian National University, May 1997.
 
8
P. Bartlett. For valid generalisation, the size of the weights is more important than the size of the network. In Advances in Neural Information Processing Systems, 9. Morgan Kaufmann, 1996.
9
 
10
B. Cheng and D. M. Titterington. Neural networks: a review from a statistical perspective. Statistical Science, 9(1):2-54, 1994.
 
11
 
12
R. O. Duda and P. E. Hart. Pattern Classification and Scene Analysis. John Wiley, 1973.
 
13
14
 
15
S. B. Holden. Cross-validation and the PAC learning model. Research Note RN/96/64, Department of Computer Science, University College London, December 1996.
 
16
S. B. Holden. On Algorithmic Stability and the Analysis of the Cross-Validation and Holdout Estimates. Research Note RN/97/73, Department of Computer Science, University College London.
 
17
18
19
 
20
M. J. Kearns and R. E. Schapire. Efficient distributionfree learning of probabilistic concepts. In Proc. of the 31st Symposium on the Foundations of Comp. Sci., pages 382-391. IEEE Computer Society Press, Los Alamitos, CA, 1990.
 
21
D. Pollard. Convergence of Stochastic Processes. Springer-Vefiag, 1984.
 
22
N. Sauer. On the density of families of sets. Journal of Combinatorial Theory (A), 13:145-147, 1972.
 
23
J. Shawe-Taylor, P. Bartlett. R.C. Williamson, M. Anthony. Structural risk minimisation over datadependent hierarchies. To appear, IEEE Transactions on Information Theory.
 
24
G.T. Toussaint. Bibliography on estimation of misctassification. IEEE Transactions on Information Theory, 20(4):472-479, 1974.
25
 
26
 
27
V. N. Vapnik and A. Y. Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probab. and its Applications, 16(2):264-280, 1971.


Collaborative Colleagues:
Martin Anthony: colleagues
Sean B. Holden: colleagues