ACM Home Page
Please provide us with feedback. Feedback
Coresets in dynamic geometric data streams
Full text PdfPdf (168 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thirty-seventh annual ACM symposium on Theory of computing table of contents
Baltimore, MD, USA
SESSION: Session 5A table of contents
Pages: 209 - 217  
Year of Publication: 2005
ISBN:1-58113-960-8
Authors
Gereon Frahling  University of Paderborn, Germany
Christian Sohler  University of Paderborn, Germany
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 65,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1060590.1060622
What is a DOI?

ABSTRACT

A dynamic geometric data stream consists of a sequence of m insert/delete operations of points from the discrete space 1,…,Δd [26]. We develop streaming (1 + ε)-approximation algorithms for k-median, k-means, MaxCut, maximum weighted matching (MaxWM), maximum travelling salesperson (MaxTSP), maximum spanning tree (MaxST), and average distance over dynamic geometric data streams. Our algorithms maintain a small weighted set of points(a coreset) that approximates with probability 2/3 the current point set with respect to the considered problem during the m insert/delete operations of the data stream. They use poly (ε-1, log m, log Δ) space and update time per insert/delete operation for constant k and dimension dHaving a coreset one only needs a fast approximation algorithm for the weighted problem to compute a solution quickly. In fact, even an exponential algorithm is sometimes feasible as its running time may still be polynomial in n. For example one can compute in poly(log n, exp(O((1+log (1⁄ε)⁄ε)d-1))) time a solution to k-median and k-means [21] where n is the size of the current point set and k and d are constants. Finding an implicit solution to MaxCut can be done in poly(log n, exp((1⁄ε)O(1))) time. For MaxST and average distance we require poly(log n, ε-1) time and for MaxWM we require O(n3) time to do this.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
4
 
5
6
 
7
T. Chan, B. Sadjad. Geometric Optimization Problems Over Sliding Windows. Proc 15th Annual International Symposium on Algorithms and Computation (ISAAC), pp. 246--258, 2004.
8
 
9
G. Cormode and S. Muthukrishnan. Radial histograms for spatial streams. DIMACS Technical Report 2003-11, 2003.
 
10
G. Cormode and S. Muthukrishnan. Improved Data Stream Summaries: The Count-Min Sketch and its Applications. Proc 6th Latin American Theoretical Informatics (LATIN), pp. 29--38, 2004.
 
11
G. Cormode and S. Muthukrishnan and I. Rozenbaum. Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Sampling. DIMACS Technical Report 2005-11, 2005.
 
12
J. Feigenbaum, S. Kannan, and J. Zhang. Computing diameter in the streaming and sliding window models. Technical Report YALEU/DCS/TR-1245, Yale University, 2002.
 
13
 
14
15
 
16
17
 
18
19
20
21
22
 
23
P. Indyk. Personal communication, 2004.
 
24
 
25
26
 
27
 
28
 
29
 
30
R. Prim. Shortest Connection Networks and some Generalizations. Bell Systems Technical Journal, 36:1389--1401, 1957.
 
31
32

CITED BY  9

Collaborative Colleagues:
Gereon Frahling: colleagues
Christian Sohler: colleagues