ACM Home Page
Please provide us with feedback. Feedback
Nomograms for visualizing support vector machines
Full text PdfPdf (427 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining table of contents
Chicago, Illinois, USA
SESSION: Research track paper table of contents
Pages: 108 - 117  
Year of Publication: 2005
ISBN:1-59593-135-X
Authors
Aleks Jakulin  Jožef Stefan Institute, Ljubljana, Slovenia
Martin Možina  University of Ljubljana, Slovenia
Janez Demšar  University of Ljubljana, Slovenia
Ivan Bratko  University of Ljubljana, Slovenia
Blaž Zupan  University of Ljubljana, Slovenia
Sponsors
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 70,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1081870.1081886
What is a DOI?

ABSTRACT

We propose a simple yet potentially very effective way of visualizing trained support vector machines. Nomograms are an established model visualization technique that can graphically encode the complete model on a single page. The dimensionality of the visualization does not depend on the number of attributes, but merely on the properties of the kernel. To represent the effect of each predictive feature on the log odds ratio scale as required for the nomograms, we employ logistic regression to convert the distance from the separating hyperplane into a probability. Case studies on selected data sets show that for a technique thought to be a black-box, nomograms can clearly expose its internal structure. By providing an easy-to-interpret visualization the analysts can gain insight and study the effects of predictive factors.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
G. W. Brier. Verification of forecasts expressed in terms of probability. Weather Rev, 78:1--3, 1950.
 
3
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2005. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
 
4
J. Demšar and B. Zupan. Orange: From experimental machine learning to interactive data mining, 2004. White Paper (http://www.ailab.si/orange) Faculty of Computer and Information Science, University of Ljubljana, Slovenia.
 
5
 
6
T. L. Hankins. Blood, dirt, and nomograms: A particular history of graphs. ISIS: Journal of the History of Science in Society, 90:50--80, 1999.
 
7
F. E. Harrell. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. Springer, New York, 2001.
 
8
D. Harrison and D. L. Rubinfeld. Hedonic prices and the demand for clean air. J Environ Economics & Management, 5:81--102, 1978.
 
9
T. Hastie and R. Tibshirani. Generalized Additive Models. Chapman and Hall, London, 1990.
 
10
S. Hettich and S. D. Bay. The UCI KDD archive. Irvine, CA: University of California, Department of Information and Computer Science, 1999.
 
11
D. W. Hosmer and S. Lemeshow. Applied Logistic Regression. John Wiley & Sons, New York, 2000.
 
12
A. Jakulin. Extensions to the Orange data mining framework, January 2002. http://www.ailab.si/aleks/orng/.
 
13
A. Jakulin and I. Bratko. Analyzing attribute dependencies. In N. Lavrač, D. Gamberger, H. Blockeel, and L. Todorovski, editors, Proc. of Principles of Knowledge Discovery in Data (PKDD), volume 2838 of LNAI, pages 229--240. Springer-Verlag, September 2003.
14
 
15
M. W. Kattan, J. A. Eastham, A. M. Stapleton, T. M. Wheeler, and P. T. Scardino. A preoperative nomogram for disease recurrence following radical prostatectomy for prostate cancer. J Natl Cancer Inst, 90(10):766--71, 1998.
16
 
17
J. Lubsen, J. Pool, and E. van der Does. A practical device for the application of a diagnostic or prognostic function. Methods of Information in Medicine, 17:127--129, 1978.
 
18
A. J. Miller. Algorithm AS 274: Least squares routines to supplement those of Gentleman. Appl. Statist., 41(2):458--478, 1992.
 
19
 
20
K. Pelckmans, J.A.K. Suykens, and B. De Moor. Building sparse representations and structure determination on LS-SVM substrates. Neurocomputing, 64:137--159, March 2005.
 
21
J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Advances in Large Margin Classifiers. MIT Press, 1999.
 
22
B. Schölkopf and A. J. Smola. Learning with Kernels. MIT Press, 2002.
 
23


Collaborative Colleagues:
Aleks Jakulin: colleagues
Martin Možina: colleagues
Janez Demšar: colleagues
Ivan Bratko: colleagues
Blaž Zupan: colleagues