ACM Home Page
Please provide us with feedback. Feedback
Supervised clustering with support vector machines
Full text PdfPdf (898 KB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 217 - 224  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Thomas Finley  Cornell University, Ithaca, NY
Thorsten Joachims  Cornell University, Ithaca, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 66,   Citation Count: 8
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102379
What is a DOI?

ABSTRACT

Supervised clustering is the problem of training a clustering algorithm to produce desirable clusterings: given sets of items and complete clusterings over these sets, we learn how to cluster future sets of items. Example applications include noun-phrase coreference clustering, and clustering news articles by whether they refer to the same topic. In this paper we present an SVM algorithm that trains a clustering algorithm by adapting the item-pair similarity measure. The algorithm may optimize a variety of different clustering functions to a variety of clustering performance measures. We empirically evaluate the algorithm for noun-phrase and news article clustering.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
4
 
5
Cohen, W., & Richman, J. (2001). Learning to match and cluster entity names. ACM SIGIR workshop on Mathematical/Formal Methods in IR.
 
6
De Bie, T., Momma, M., & Cristianini, N. (2003). Efficiently learning the metric using side-information. ALT2003 (pp. 175--189). Sapporo, Japan: Springer.
 
7
Demaine, E., & Immorlica, N. (2003). Correlation clustering with partial information. RANDOM-APPROX 2003 (pp. 1--13). Princeton, New Jersey.
 
8
Joachims, T. (2003). Learning to align sequences: A maximum-margin approach (Technical Report).
 
9
 
10
11
 
12
McCallum, A., & Wellner, B. (2003). Toward conditional models of identity uncertainty with application to proper noun coreference. IIWeb (pp. 79--84).
 
13
MUC-6 (1995). Proceedings of the Sixth Message Understanding Conference (MUC-6). San Francisco, California: Morgan Kaufmann.
 
14
 
15
Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66, 846--850.
 
16
 
17
18
 
19
Taskar, B., Guestrin, C., & Koller, D. (2003). Max-margin markov networks. NIPS.
20
 
21
 
22

CITED BY  8
Collaborative Colleagues:
Thomas Finley: colleagues
Thorsten Joachims: colleagues