| Nearest hyperdisk methods for high-dimensional classification |
| Full text |
Pdf
(1.42 MB)
|
| Source
|
ICML; Vol. 307
archive
Proceedings of the 25th international conference on Machine learning
table of contents
Helsinki, Finland
Pages 120-127
Year of Publication: 2008
ISBN:978-1-60558-205-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 38, Citation Count: 0
|
|
|
ABSTRACT
In high-dimensional classification problems it is infeasible to include enough training samples to cover the class regions densely. Irregularities in the resulting sparse sample distributions cause local classifiers such as Nearest Neighbors (NN) and kernel methods to have irregular decision boundaries. One solution is to "fill in the holes" by building a convex model of the region spanned by the training samples of each class and classifying examples based on their distances to these approximate models. Methods of this kind based on affine and convex hulls and bounding hyperspheres have already been studied. Here we propose a method based on the bounding hyperdisk of each class - the intersection of the affine hull and the smallest bounding hypersphere of its training samples. We argue that in many cases hyperdisks are preferable to affine and convex hulls and hyperspheres: they bound the classes more tightly than affine hulls or hyperspheres while avoiding much of the sample overfitting and computational complexity that is inherent in high-dimensional convex hulls. We show that the hyperdisk method can be kernelized to provide nonlinear classifiers based on non-Euclidean distance metrics. Experiments on several classification problems show promising results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Cevikalp, H., Larlus, D., Douze, M., & Jurie, F. (2007). Local subspace classifiers: Linear and nonlinear approaches. IEEE Workshop on Machine Learning for Signal Processing. Thessaloniki, Greece.
|
| |
3
|
|
| |
4
|
Cevikalp, H., Triggs, B., Jurie, F., & Polikar, R. (2008). Margin-based discriminant dimensionality reduction for visual recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
|
| |
5
|
Gulmezoglu, M. B., Dzhafarov, V., & Barkana, A. (2001). The common vector approach and its relation to principal component analysis. IEEE Trans. Speech Audio Proc., 9, 655--662.
|
| |
6
|
Hinton, G. E., Dayan, P., & Revow, M. (1997). Modeling the manifolds of images of handwritten digits. IEEE Transactions on Neural Networks, 18, 65--74.
|
| |
7
|
Laaksonen, J. (1997). Subspace classifiers in recognition of handwritten digits. Doctoral dissertation, Helsinki University of Technology.
|
| |
8
|
|
| |
9
|
|
| |
10
|
Nalbantov, G. I., Groenen, P. J. F., & Bioch, J. C. (2007). Nearest convex hull classification (Technical Report). Econometric Institute and Erasmus Research Institute of Management.
|
| |
11
|
Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290, 2323--2326.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Vincent, P., & Bengio, Y. (2001). K-local hyperplane and convex distance nearest neighbor algorithms. NIPS.
|
| |
17
|
Wang, J., Neskovic, P., & Cooper, L. N. (2005). Pattern classification via single spheres. Discovery Science (pp. 241--252).
|
|