ACM Home Page
Please provide us with feedback. Feedback
Multi-classification by categorical features via clustering
Full text PdfPdf (364 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 920-927  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Yevgeny Seldin  The Hebrew University of Jerusalem, Israel
Naftali Tishby  The Hebrew University of Jerusalem, Israel
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 37,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390272
What is a DOI?

ABSTRACT

We derive a generalization bound for multi-classification schemes based on grid clustering in categorical parameter product spaces. Grid clustering partitions the parameter space in the form of a Cartesian product of partitions for each of the parameters. The derived bound provides a means to evaluate clustering solutions in terms of the generalization power of a built-on classifier. For classification based on a single feature the bound serves to find a globally optimal classification rule. Comparison of the generalization power of individual features can then be used for feature ranking. Our experiments show that in this role the bound is much more precise than mutual information or normalized correlation indices.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Asuncion, A., & Newman, D. (2007). UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html.
 
2
Blanchard, G., & Fleuret, F. (2007). Occam's hammer. COLT.
 
3
 
4
 
5
Maurer, A. (2004). A note on the PAC-Bayesian theorem. www.arxiv.org.
 
6
 
7
 
8
Sabato, S., & Shalev-Shwartz, S. (2007). Prediction by categorical features: Generalization properties and application to feature ranking. COLT.
 
9
Seldin, Y., Slonim, N., & Tishby, N. (2007). Information bottleneck for non co-occurrence data. NIPS.
 
10
Shamir, O., Sabato, S., & Tishby, N. (2008). Learning and generalization with the information bottleneck method. Preprint.
 
11

Collaborative Colleagues:
Yevgeny Seldin: colleagues
Naftali Tishby: colleagues