ACM Home Page
Please provide us with feedback. Feedback
Approximation schemes for clustering problems
Full text PdfPdf (250 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thirty-fifth annual ACM symposium on Theory of computing table of contents
San Diego, CA, USA
SESSION: Session 1B table of contents
Pages: 50 - 58  
Year of Publication: 2003
ISBN:1-58113-674-9
Authors
W. Fernandez de la Vega  Université Paris-Sud, France
Marek Karpinski  University of Bonn
Claire Kenyon  Ecole Polytechnique, France
Yuval Rabani  Technion—IIT, Haifa, Israel.
Sponsors
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 62,   Citation Count: 18
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/780542.780550
What is a DOI?

ABSTRACT

Let k be a fixed integer. We consider the problem of partitioning an input set of points endowed with a distance function into k clusters. We give polynomial time approximation schemes for the following three clustering problems: Metric k-Clustering, l 22 k-Clustering, and l22 k-Median. In the k-Clustering problem, the objective is to minimize the sum of all intra-cluster distances. In the k-Median problem, the goal is to minimize the sum of distances from points in a cluster to the (best choice of) cluster center. In metric instances, the input distance function is a metric. In l 22 instances, the points are in R d and the distance between two points x,y is measured by x−y 22 (notice that (R d, ⋅ 22 is not a metric space). For the first two problems, our results are the first polynomial time approximation schemes. For the third problem, the running time of our algorithms is a vast improvement over previous work.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
5
6
7
 
8
 
9
B. Carl and I. Stephani. Entropy, Compactness and the Approximation of Operators. Cambridge University Press, 1990.
 
10
11
 
12
S. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the Society for Information Science, 41(6):391--407, 1990.
 
13
 
14
 
15
 
16
 
17
18
 
19
 
20
 
21
 
22
 
23
V. Kann, S. Khanna, J. Lagergren, and A. Panconesi. On the hardness of approximating Max k-Cut and its dual. In Proc. of the 4th Israeli Symp. on Theory of Computing and Systems (ISTCS), 1996. Also in Chicago Journal of Theoretical Computer Science, 1997.
24
 
25
26
27
28
 
29
R. Shamir and R. Sharan. Algorithmic approaches to clustering gene expression data. In T. Jiang, T. Smith, Y. Xu, M.Q. Zhang eds., Current Topics in Computational Biology, MIT Press, to appear.
 
30

CITED BY  19

Collaborative Colleagues:
W. Fernandez de la Vega: colleagues
Marek Karpinski: colleagues
Claire Kenyon: colleagues
Yuval Rabani: colleagues