|
ABSTRACT
An algorithm is a weak learner if with some small probability it
outputs a hypothesis with error slightly below 50%. This paper presents
sufficient conditions for weak learning.
Our main result requires a “consistency oracle” for the
concept class
F
which decides for a given set of examples whether
there is a concept in
F
consistent with the examples. We show that such an
oracle can be used to construct a computationally efficient weak
learning algorithm for
F
if
F
is learnable at all. We consider
consistency oracles which are allowed to give wrong answers and
discusses how the number of incorrect answers effects the oracle's use
in computationally efficient weak learning algorihms.
We also define “weak Occam algorithms” which, when
given a set of m examples, select a
consistent hypothesis from some class of
2m-(1/p(m))
possible hypotheses. We show that these weak Occam algorithms are also
weak learners. In contrast, we show that an Occam style algorithm which
selects a consistent hypothesis from a class of
2m+1-2
hypotheses is not a weak learner.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
AHW87
|
N. Alon , D. Haussler , E. Welzl, Partitioning and geometric embedding of range spaces of finite Vapnik-Chervonenkis dimension, Proceedings of the third annual symposium on Computational geometry, p.331-340, June 08-10, 1987, Waterloo, Ontario, Canada
[doi> 10.1145/41958.41994]
|
| |
BEHW87
|
|
 |
BEHW89
|
|
| |
Bon72
|
J. A. Bondy. Induced subsets. Journal of Combtnatorzal Theory, 12( B):21)1-202, 1972.
|
| |
Fre90
|
|
| |
HKLW91
|
|
| |
HKS91
|
David Haussler , Michael Kearns , Robert Schapire, Bounds on the sample complexity of Bayesian learning using information theory and the VC dimension, Proceedings of the fourth annual workshop on Computational learning theory, p.61-74, August 05-07, 1991, Santa Cruz, California, United States
|
| |
HLW
|
|
| |
HW92
|
D. Helrnbold and M. K. Warmuth. On weak learning. In Proceedings of the Third NEC Research Symposium on Computational Learning and Cognition, SIAM, 3600 University City Science Center, Philadelphia, PA 19104-2688, May 1992.
|
 |
KV89
|
|
| |
Sau72
|
N. Sauer. On the density of families of sets. Journal of Combinatorial Theory (Series A), 13:145-147, 1972.
|
| |
Sch90
|
|
 |
Val84
|
|
| |
VC71
|
V. N. Vapnik and A. Ya. Chervonenlkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Pro babtltty and its Application, 16(2):264-280, 1971.
|
| |
Vov90
|
|
CITED BY 2
|
|
Nicolò Cesa-Bianchi , Yoav Freund , David P. Helmbold , David Haussler , Robert E. Schapire , Manfred K. Warmuth, How to use expert advice, Proceedings of the twenty-fifth annual ACM symposium on Theory of computing, p.382-391, May 16-18, 1993, San Diego, California, United States
|
|
|
|
|