|
ABSTRACT
We exhibit efficient algorithms for agnostic PAC-learning with rectangles, unions of two rectangles, and unions of k intervals as hypotheses. These hypothesis classes are of some interest from the point of view of applied machine learning, because empirical studies show that hypotheses of this simple type (in just one or two of the attributes) provide good prediction rules for various real-world classification problems. In addition, optimal hypotheses of this type may provide valuable heuristic insight into the structure of a real world classification problem.
The algorithms that are introduced in this paper make it feasible to compute optimal hypotheses of this type for a training set of several hundred examples. We also exhibit an approximation algorithm that can compute nearly optimal hypotheses for much larger datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
AL
|
|
| |
BN
|
|
 |
D
|
|
| |
DG a
|
D.P. Dobkin, D. Gunopulos, Computing the rectangle discrepancy (video), 3rd Annual Video Review of Computational Geometry
|
| |
DG b
|
D.P. Dobkin, D. Gunopulos, Computing the rectangle discrepancy, Tech Report 443-94, Princeton University
|
| |
H
|
|
| |
HSV
|
K.U. Hoeffgen, H. U. Simon, K. S. Van Horn, Robust trainability of single neurons, preprint (1993)
|
| |
Ho
|
|
 |
K
|
|
| |
KL
|
|
| |
KS
|
M.J. Kearns, R. E. Schapire, Efficient distribution-free learning of probabilisiic concepts, Proc. of the 31st Annual IEEE Symp. on Foundations of Computer Science, 1990, 382 - 391
|
 |
KSS
|
Michael J. Kearns , Robert E. Schapire , Linda M. Sellie, Toward efficient agnostic learning, Proceedings of the fifth annual workshop on Computational learning theory, p.341-352, July 27-29, 1992, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/130385.130424]
|
| |
M
|
|
| |
Me
|
|
| |
PS
|
|
| |
Q
|
|
| |
T
|
M. Talagrand, Sharper bounds for empirical processes, to appear in Annals of Probability and its Applications
|
 |
V 84
|
|
| |
V 85
|
L.G. Valiant, Learning disjunctions of conjunctions, Proc. of the 9th Intern. Joint Conf. on Art. Int., 1985, 560 - 566
|
| |
WGT
|
|
| |
WK 90
|
S.M. Weiss, I. Kapouleas, An empirical comparison of paltern recognition, neural nets, and machine learning classification methods, Proc. of the 11th Int. Joint Conf. on Art. Int. 1990, Morgan Kaufmann, 781 - 787
|
| |
WK 91
|
S.M. Weiss, C. A. Kulikowski, Computer Systems thai Learn, 1991, Morgan Kaufmann
|
REVIEW
"Sally Goldman : Reviewer"
Most work in computational learning theory assumes the learner
knows a set of classification rules (or hypotheses), one of which
correctly classifies most examples of interest. While no such knowledge
is available for most real-life problems,
more...
|