|
ABSTRACT
A memory-based learning system is an extended memory management system that decomposes the input space either statically or dynamically into subregions for the purpose of storing and retrieving functional information. The main generalization techniques employed by memory-based learning systems are the nearest-neighbor search, space decomposition techniques, and clustering. Research on memory-based learning is still in its early stage. In particular, there are very few rigorous theoretical results regarding memory requirement, sample size, expected performance, and computational complexity. In this paper, we propose a model for memory-based learning and use it to analyze several methods— &egr;-covering, hashing, clustering, tree-structured clustering, and receptive-fields— for learning smooth functions. The sample size and system complexity are derived for each method. Our model is built upon the generalized PAC learning model of Haussler and is closely related to the method of vector quantization in data compression. Our main result is that we can build memory-based learning systems using new clustering algorithms [LiVb] to PAC-learn in polynomial time using only polynomial storage in typical situations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Alba
|
|
| |
Albb
|
J. S. Albus, "A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)," Journal of Dynamic Systems, Measurement, and Control(September 1975), 220-227.
|
| |
Albc
|
J. S. Albus, "Data Storage in the Cerebellar Model Articulation Controller (CMAC)," Journal of Dynamic Systems, Measurement, and Control (September 1975), 228-233.
|
| |
CAW
|
J. L. Carter and M. N. Wegman, "Universal Classes of Hash Functions," Journal of Computer System and Science 18 (April 1979), 143- 154.
|
| |
Chv
|
V. Chv~tal, "A Greedy Heuristic for the Set- Covering Problem," Mathematics of Operations Research 4(1979), 233-235.
|
| |
Dan
|
G. Dantzig, "Programming of Interdependent Activities, II, Mathematical Models," in Activity Analysis of Production and Allocation, John Wiley &; Sons, Inc, New York, 1951, 19-32.
|
| |
DeW
|
|
| |
Dev
|
|
| |
Duda
|
R. M. Dudley, "Central Limit Theorems for Empirical Measures," Annals of Probability 6 (1978), 899-929.
|
| |
Dudb
|
R. M. Dudley, "A Course on Empirical Processes,'' in Lecture Note in Mathematics 1097, 1984.
|
| |
Fri
|
J. H. Friedman, Multivariate Adaptive Regression Splines, Technical Report 102, Standford University, Lab for Computational Statistics, 1988.
|
| |
GaJ
|
|
| |
Ger
|
A. Gersho, "On the Structure of Vector Quantizers," IEEE Transactions on Information Theory 28 (March 1982), 157-166.
|
| |
GeG
|
|
| |
Gra
|
R. M. Gray, "Vector Quantization," IEEE ASSP Magazine (April 1984), 4-29.
|
| |
Haua
|
D. Haussler, "Generalizing the PAC Model: Sample Size Bounds from Metric Dimension- Based Uniform Convergence Results," in Proceedings of the 30th Annual IEEE Symposium on Foundations of Computer Science, 1989, 40-45.
|
| |
Haub
|
D. Haussler, "Decision Theoretic Generalizations of the PAC Model for Neural Nets and Other Learning Applications," UCSC-CRL-89- 30, September 1989 (Revised December 1990, revised again January 1992).
|
| |
HKL
|
David Haussler , Michael Kearns , Nick Littlestone , Manfred K. Warmuth, Equivalence of models for polynomial learnability, Proceedings of the first annual workshop on Computational learning theory, p.42-55, August 03-05, 1988, MIT, Cambridge, Massachusetts, United States
|
| |
HAL
|
|
| |
Joh
|
D. S. Johnson, "Approximation Algorithms for Combinatorial Problems," Journal of Computer and System Sciences 9 (1974), 256-278.
|
| |
KaH
|
O. Kariv and S. L. Hakimi, "An Algorithmic Approach to Network Location Problems. II: The p-Medians," SIAM Journal on Applied Mathematics (1979), 539-560.
|
| |
Kar
|
|
| |
Kha
|
L. G. Khachiyan, "A Polynomial Algorithm in Linear Programming," Soviet Math. Doklady 20 (1979), 191-194.
|
| |
LiVa
|
J.-H. Lin and J. S. Vitter, "Nearly Optimal Vector Quantization via Linear Programming," in Proceedings of the IEEE Data Compression Conference, Snowbird, Utah, March 1992, 22- 31.
|
 |
LiVb
|
|
| |
Lov
|
L. Lov~sz, "On the Ratio of Optimal Integral and Fractional Covers," Discrete Mathematics 13 (1975), a83-390.
|
| |
MeS
|
N. Megiddo and K. J. Supowit, "On the Complexity of Some Common Geometric Location Problems," SIAM Journal on Computing 13 (1984), 182-196.
|
| |
Mil
|
W. T. Miller, "Sensor-Based Control of Robotic Manipulators Using a General Learning Algorithms," IEEE Journal of Robotics and Automation 3 (April 1987), 157-165.
|
| |
MGK
|
|
| |
Moo
|
|
| |
MoD
|
J. Moody and C. Darken, "Learning with Localized Receptive Fields," in Proceedings of the 1988 Connectionist Models Summer School, Morgan Kaufmann Publisher , 1988, 133-143.
|
| |
Mor
|
A. W. Moore, Acquisition of Dynamic Control Knowledge for Robotic Manipulator, Manuscript, 1989.
|
| |
Pap
|
C. H. Papadimitriou, "Worst-case and Probabilistic Analysis of a Geometric Location Problem,'' SIAM Journal on Computing 10(1981), 542-557.
|
| |
PoGa
|
|
| |
PoGb
|
|
| |
Pola
|
D. Pollard, Convergence of Stochastic Processes, Springer-Verlag New York Inc, 1984.
|
| |
Polb
|
D. Pollard, Empirical Processes: Theory and Applications, NSF-CBMS Regional Conference Series in Probability and Statistics Volume 2, 1990.
|
| |
RaA
|
M. V. Ramakrishna and V. Awasthi, Manuscript, December 1991.
|
| |
Ris
|
|
| |
Sau
|
N. Sauer, "On the Density of Families of Sets," Journal of Combinatorial Theory (.4) 13 (1972), 145-147.
|
| |
Sie
|
A. Siegel, Coalesced Hashing is Computably Good, Manuscript, 1991.
|
| |
Vap
|
|
| |
VaC
|
V. N. Vapnik and A. Ya. Chervonenkis, "On the Uniform Convergence of Relative Frequencies of Events to their Probabilities," Theory of Probability and its Applications (1971), 264- 280.
|
| |
ViC
|
|
|