|
ABSTRACT
This paper discusses visual methods that can be used to understand and interpret the results of classification using support vector machines (SVM) on data with continuous real-valued variables. SVM induction algorithms build pattern classifiers by identifying a maximal margin separating hyperplane from training examples in high dimensional pattern spaces or spaces induced by suitable nonlinear kernel transformations over pattern spaces. SVM have been demonstrated to be quite effective in a number of practical pattern classification tasks. Since the separating hyperplane is defined in terms of more than two variables it is necessary to use visual techniques that can navigate the viewer through high-dimensional spaces. We demonstrate the use of projection-based tour methods to gain useful insights into SVM classifiers with linear kernels on 8-dimensional data.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
M. Brown, W. Grundy, D. Lin, N. Christianini, C. Sugnet, T. Furey, M. Ares Jr., and D. Haussler. Knowledge based analysis of microarray gene expression data using support vector machines. TR UCSC CRL-99-09, CRL, Santa Cruz, CA., 1999.
|
| |
3
|
A. Buja, D. Asimov, C. Hurley, and J. A. McDonald. Elements of a Viewing Pipeline for Data Analysis. In W. S. Cleveland and M. E. McGill editors, Dynamic Graphics for Statistics, pages 277-308. Wadsworth, Monterey, CA, 1988.
|
| |
4
|
A. Buja, D. Cook, D. Asimov, and C. Hurley. Dynamic Projections in High-Dimensional Visualization: Theory and Computational Methods. TR, AT&T Labs, Florham Park, N J, 1997.
|
| |
5
|
D. Caragea, A. Silvescu, and V. Honavar. Agents that learn from distributed dynamic data sources. In Proc. of the Workshop on Learning Agents, Agents 2000/ECML 2000, Barcelona, Spain, 53-61, 2000.
|
| |
6
|
D. Caragea, A. Silvescu, and V. Honavar. Analysis and synthesis of agents that learn from distributed dynamic data sources. In S. E. A. Wermter, Ed., Neural Network Architectures Inspired by Neuroscience. Springer-Verlag, 2001.
|
| |
7
|
D. Cook and A. Buja. Manual Controls For High-Dimensional Data Projections. Journal of Computational and Graphical Statistics, 6(4):464-480, 1997. Also see www.public, iastate, edu/ ~dicook/researchlpapers/manip. html.
|
| |
8
|
D. Cook, A. Buja, J. Cabrera, and C. Hurley. Grand Tour and Projection Pursuit. Journal of Comp. and Graphical Statistics, 4(3):155-172, 1995.
|
| |
9
|
|
| |
10
|
D. J. Donnell, A. Buja, and W. Stuetzle. Analysis of Additive Dependencies using Smallest Additive Principle Components. Annals of Statistics, 22, 1994.
|
| |
11
|
M. Forina, C. Armanino, S. Lanteri, and E. Tiscornia. Classification of olive oils from their fatty acid composition. In H. Martens and H. Russwurm Jr., editors, Food Research and Data Analysis, pages 189-214. Applied Science Publishers, London, 1983.
|
| |
12
|
G. Furnas and A. Buja. Prosection Views: Dimensional Inference Through Sections and Projections. Journal of Computational and Graphical Statistics, 3(4):323-385, 1994.
|
| |
13
|
|
| |
14
|
T. Joachims. Making Large-Scale SVM Learning Practical. MIT Press, 1999.
|
| |
15
|
|
| |
16
|
wgg. public, iastate, edu/~dicook/JSS/paper
|
| |
17
|
Limn: www.public.iastate.edu/ dicook/Limn
|
CITED BY
|
|
Justin Talbot , Bongshin Lee , Ashish Kapoor , Desney S. Tan, EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers, Proceedings of the 27th international conference on Human factors in computing systems, April 04-09, 2009, Boston, MA, USA
|
|