ACM Home Page
Please provide us with feedback. Feedback
Learning with a slowly changing distribution
Full text PdfPdf (805 KB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the fifth annual workshop on Computational learning theory table of contents
Pittsburgh, Pennsylvania, United States
Pages: 243 - 252  
Year of Publication: 1992
ISBN:0-89791-497-X
Author
Peter L. Bartlett  Department of Electrical Engineering, University of Queensland, Queensland 4072, AUSTRALIA
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 11,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/130385.130412
What is a DOI?

ABSTRACT

In this paper, we consider the problem of learning a subset of a domain from randomly chosen examples when the probability distribution of the examples changes slowly but continually throughout the learning process. We give upper and lower bounds on the best achievable probability of misclassification after a given number of examples. If d is the VC-dimension of the target function class, t is the number of examples, and &Ugr; is the amount by which the distribution is allowed to change (measured by the largest change in the probability of a subset of the domain), the upper bound decreases as d/t initially, and settles to O(d2/3&Ugr;1/2) for large t. These bounds give necessary and sufficient conditions on &Ugr;, the rate of change of the distribution of examples, to ensure that some learning algorithm can produce an acceptably small probability of misclassification. We also consider the case of learning a near-optimal subset of the domain when the examples and their labels are generated by a joint probability distribution on the example and label spaces. We give an upper bound on &Ugr; that ensures learning is possible from a finite number of examples.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
ABST90
M. Anthony, N. Biggs, and J. Shawe.Taylor. Learnability and formal concept analysis. Technical Report CSD-TR-624, UCL, 1990.
 
AST90
M. Anthony and J. Shawe-Taylor. A result of Vapnik with applications. Technical port CSD-TR-628, UCL, 1990.
BEHW89
 
Hal50
P.R. Halmos. Measure Theory. Van Nostrand, 1950.
 
HKLW88
 
HL9l
 
HLW90
 
Kra88
A.H. Kramer. Learning despite distribution drift. In Proceedings of the Connectionist Models Summer School, pages 201-210. Morgan Kaufmann, San Mateo, CA, 1988.
 
Kul67
S. Kullbaek. A lower bound for discrimination information in terms of variation. IEEE Transactions on Information Theory, IT-13:126-127, 1967.
 
Ren61
A. Renyi. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, volume 1, pages 547-561. University of California Press, 1961.
Val84
 
Vap82
 
VC71
V.N. Vapnik and A. Ya. Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications, XVI(2):264-280, 1971.

CITED BY  11
 
 
 
 


Peer to Peer - Readers of this Article have also read: