ACM Home Page
Please provide us with feedback. Feedback
Optimal algorithms for approximate clustering
Full text PdfPdf (1.08 MB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the twentieth annual ACM symposium on Theory of computing table of contents
Chicago, Illinois, United States
Pages: 434 - 444  
Year of Publication: 1988
ISBN:0-89791-264-0
Authors
Tomás Feder  Computer Science Department, Stanford University, Stanford, CA
Daniel Greene  Xerox Palo Alto Research Center, Palo Alto, CA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 24,   Downloads (12 Months): 166,   Citation Count: 54
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/62212.62255
What is a DOI?

ABSTRACT

In a clustering problem, the aim is to partition a given set of n points in d-dimensional space into k groups, called clusters, so that points within each cluster are near each other. Two objective functions frequently used to measure the performance of a clustering algorithm are, for any L4 metric, (a) the maximum distance between pairs of points in the same cluster, and (b) the maximum distance between points in each cluster and a chosen cluster center; we refer to either measure as the cluster size. We show that one cannot approximate the optimal cluster size for a fixed number of clusters within a factor close to 2 in polynomial time, for two or more dimensions, unless P=NP. We also present an algorithm that achieves this factor of 2 in time &Ogr;(n log k), and show that this running time is optimal in the algebraic decision tree model. For a fixed cluster size, on the other hand, we give a polynomial time approximation scheme that estimates the optimal number of clusters under the second measure of cluster size within factors arbitrarily close to 1. Our approach is extended to provide approximation algorithms for the restricted centers, suppliers, and weighted suppliers problems that run in optimal &Ogr;(n log k) time and achieve optimal or nearly optimal approximation bounds.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
Ba83
Brenda S. Baker, "Approximation algorithms for NP-complete problems on planar graphs." Proc. 24th IEEE FOCS (1983), pp. 265 273.
Be83
 
Br77
P. Bruckcr, "On the complexity of clustering problems," in Optimization and Operations Research (R. Hcnn, B. Kortc, W. Olctti eds.), Springer-Verlag, Berlin, Heidelberg, New York (1977), pp. 45-54.
 
Eq87
William Equitz, "Fast algorithms for vector quantization picture coding," Proc. International Conference on Acoustics, Speech, an(t Signal Processing (1987), I)P- 725 728.
 
FPT81
R.J. Fowler, M.S. Paterson and S.L. Tanimoto, "Optimal packing and covering in the plane are NP-complete," Inf. Proc. Letters, Vol. 12, No. 3 (1981), pp. 133 137.
 
GJ77
M.R. Garey and D.S. Johnson, "The rectilinear Steiner tree problem is NP- complete," Siam J. Appl. Math. 32 (1977), pp. 826-834.
 
GJ79
 
Go85
Te6filo F. Gonz~ez, "Clustering to minimize the maximum interclustcr distance," Theoretical Computer Science 38, North- Holland (1985), pp. 293 306.
 
Gr84
Robert M. Gray, "Vector quantization," IEEE ASSP Magazim:, April (1984), pp. 4-29.
 
HS85
D.S. Hochbaum and D.B. Shmoys, "A best possible heuristic for the k-center problem," Mathematics of Operations Research, Vol. 10, No. 2 (1985), the. 180 184.
HS86
 
HN79
W.L. Hsu and G.L. Nemhauser, "Easy and hard bottleneck location problems," Discr. Appl. Math. 1 (1979), pp. 209 215.
 
MS84
N. Megiddo and K.J. Supowit, "On the complexity of some common geometric location problems," Siam J. Comput., Vol. 13, No. 1 (1984), pp. 182 196.
 
Me88
Stuart G. Mentzer, "Lower bounds on metric k-center problems," Manuscript, 1988.
 
PS85
 
Va86
Pravin M. Vaidya, "An optimal algorithm for the all-nearest-neighbors problent," Proc. 27th IEEE FOCS (1986), pp. 117 122.
 
Vs87
Stephen Vavasis, personal communication.

CITED BY  54

Collaborative Colleagues:
Tomás Feder: colleagues
Daniel Greene: colleagues