|
ABSTRACT
We investigate the importance of space when solving problems based on graph distance in the streaming model. In this model, the input graph is presented as a stream of edges in an arbitrary order. The main computational restriction of the model is that we have limited space and therefore cannot store all the streamed data; we are forced to make space-efficient summaries of the data as we go along. For a graph of n vertices and m edges, we show that testing many graph properties, including connectivity (ergo any reasonable decision problem about distances) and bipartiteness, requires Ω(n) bits of space. Given this, we then investigate how the power of the model increases as we relax our space restriction. Our main result is an efficient randomized algorithm that constructs a (2t + 1)-spanner in one pass. With high probability, it uses O(t .n1+1/t log2n) bits of space and processes each edge in the stream in O(t2·n1/t log n) time. We find approximations to diameter and girth via the constructed spanner. For t = Ω(log n/log log n), the space requirement of the algorithm is O(n .polylog n), and the per-edge processing time is O(polylog n). We also show a corresponding lower bound of t for the approximation ratio achievable when the space restriction is O(t.n1+1/t log2n).We then consider the scenario in which we are allowed multiple passes over the input stream. Here, we investigate whether allowing these extra passes will compensate for a given space restriction. We show that finding vertices at distance d from a particular vertex will always take d passes, for all d ∈ {1,...,t/2}, when the space restriction is o(n1+1/t). For girth, we show the existence of a direct trade-off between space and passes in the form of a lower bound on the product of the space requirement and number of passes. Finally, we conclude with two general techniques for speeding up the per-edge computation time of streaming algorithms while increasing the space by at most a log factor.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Ziv Bar-Yosseff , Ravi Kumar , D. Sivakumar, Reductions in streaming algorithms, with an application to counting triangles in graphs, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, p.623-632, January 06-08, 2002, San Francisco, California
|
| |
5
|
S. Baswana and S. Sen. A simple linear time algorithm for computing a (2k -- 1)-spanner of o(n1+1/k) size in weighted graphs. In Proc. 30th International Colloq. on Automata, Languages and Computing, pages 284--296, 2003.
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
 |
9
|
Michael Elkin , Jian Zhang, Efficient algorithms for constructing (1+,ε, β)-spanners in the distributed and streaming models, Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing, July 25-28, 2004, St. John's, Newfoundland, Canada
[doi> 10.1145/1011767.1011791]
|
 |
10
|
|
| |
11
|
J. Feigenbaum, S. Kannan, A. McGregor, S. Suri, and J. Zhang. On graph problems in a semi-streaming model. In Proc. of 31st International Colloquium on Automata, Languages and Programming, LNCS 3142, pages 531--543, 2004.
|
| |
12
|
|
 |
13
|
Anna C. Gilbert , Sudipto Guha , Piotr Indyk , Yannis Kotidis , S. Muthukrishnan , Martin J. Strauss, Fast, small-space algorithms for approximate histogram maintenance, Proceedings of the thiry-fourth annual ACM symposium on Theory of computing, May 19-21, 2002, Montreal, Quebec, Canada
[doi> 10.1145/509907.509966]
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
M. R. Henzinger, P. Raghavan, and S. Rajagopalan. Computing on data streams. Technical Report 1998--001, DEC Systems Research Center, 1998.
|
| |
18
|
|
| |
19
|
|
| |
20
|
F. Lazebnik, V. Ustimenko, and A. Woldar. A new series of dense graphs of high girth. Bulletin of the AMS, 32(1):73--79, 1995.
|
| |
21
|
J. Munro and M. Paterson. Selection and sorting with limited storage. Theoretical Computer Science, 12:315--323, 1980.
|
| |
22
|
S. Muthukrishnan. Data streams: Algorithms and applications. 2003. http://athos.rutgers.edu/~muthu/stream-1-1.ps.
|
 |
23
|
|
| |
24
|
|
 |
25
|
|
|