|
ABSTRACT
Despite extensive study over the last four decades and numerous applications, no I/O-efficient algorithm is known for the union-find problem. In this paper we present an I/O-efficient algorithm for the batched (off-line) version of the union-find problem. Given any sequence of N union and find operations, where each union operation joins two distinct sets, our algorithm uses O(sort(N)) = O(N/BlogM/BN/B) I/Os, where M is the memory size and B is the disk block size. This bound is asymptotically optimal in the worst case. If there are union operations that join a set with itself, our algorithm uses O(sort(N) + mst(N)) I/Os, where mst(N) is the number of I/Os needed to compute the minimum spanning tree of a graph with N edges. We also describe a simple and practical O(sort(N)log(N/M))-I/O algorithm for this problem, which we have implemented.We are interested in the union-find problem because of its applications in terrain analysis. A terrain can be abstracted as a height function defined over R2, and many problems that deal with such functions require a union-find data structure. With the emergence of modern mapping technologies, huge amount of elevation data is being generated that is too large to fit in memory, thus I/O-efficient algorithms are needed to process this data efficiently. In this paper, we study two terrain analysis problems that benefit from a union-find data structure: (i) computing topological persistence and (ii) constructing the contour tree. These structures have important applications such as terrain modeling, flow analysis, topological feature extraction, etc. We give the first O(sort(N))-I/O algorithms for these two problems, assuming that the input terrain is represented as a triangular mesh with N vertices.Finally, we report some preliminary experimental results, showing that our algorithms give order-of-magnitude improvement over previous methods on large data sets that do not fit in memory.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
P. K. Agarwal, L. Arge, and K. Yi. I/O-efficient construction of constrained Delaunay triangulations. In Proc. 13rd European Sympos. Algorithms, pages 355--366, 2005.
|
 |
2
|
Pankaj K. Agarwal , Herbert Edelsbrunner , John Harer , Yusu Wang, Extreme elevation on a 2-manifold, Proceedings of the twentieth annual symposium on Computational geometry, June 08-11, 2004, Brooklyn, New York, USA
[doi> 10.1145/997817.997871]
|
 |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
Lars Arge , Jeffrey S. Chase , Patrick Halpin , Laura Toma , Jeffrey S. Vitter , Dean Urban , Rajiv Wickremesinghe, Efficient Flow Computation on Massive Grid Terrain Datasets, Geoinformatica, v.7 n.4, p.283-313, December 2003
[doi> 10.1023/A:1025526421410]
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
P. T. Bremer, V. Pascucci, H. Edelsbrunner, and B. Hamann. A topological hierarchy for functions on triangulated surfaces. IEEE Trans. Vis. Comput. Graphics, 10:385--396, 2004.
|
| |
11
|
|
| |
12
|
Yi-Jen Chiang , Michael T. Goodrich , Edward F. Grove , Roberto Tamassia , Darren Erik Vengroff , Jeffrey Scott Vitter, External-memory graph algorithms, Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms, p.139-149, January 22-24, 1995, San Francisco, California, United States
|
| |
13
|
|
| |
14
|
R. Dementiev, P. Sanders, D. Schultes, and J. Sibeyn. Engineering an external memory minimum spanning tree algorithm. In Proc. 3rd IFIP Intl. Conf. on Theoretical Computer Science, pages 195--208, 2004.
|
 |
15
|
|
| |
16
|
|
| |
17
|
Environmental Systems Research Inc. ARC/INFO Professional GIS, 1997. Version 7.1.2.
|
| |
18
|
F. W. Fredman and D. E. Willard. Trans-dichotomous algorithms for minimum spanning trees and shortest paths. In Proc. 31st IEEE Sympos. Found. Comput. Sci. pages 718--725, 1990.
|
 |
19
|
|
| |
20
|
H. Gabow and R. Tarjan. A linear time algorithm for a special case of disjoint set union. J. Comput. Syst. Sci., 30:209--221, 1985.
|
 |
21
|
|
| |
22
|
M. T. Goodrich, J.-J. Tsay, D. E. Vengroff, and J. S. Vitter. External-memory computational geometry. In Proc. 34th IEEE Sympos. Found. Comput. Sci., pages 714--723, 1993.
|
| |
23
|
GRASS Development Team. GRASS GIS homepage. http://www.baylor.edu/grass/.
|
| |
24
|
S. Jenson and J. Domingue. Extracting topographic structure from digital elevation data for geographic information system analysis. Photogrammetric Engineering and Remote Sensing, 54(11):1593--1600, 1988.
|
| |
25
|
B. M. E. Moret and H. D. Shapiro. An empirical analysis of algorithms for constructing a minimum spanning tree. In Proc. Workshop Algorithms Data Struct., pages 400--411, 1991.
|
| |
26
|
North Carolina Flood Mapping Program. http://www.ncfloodmaps.com.
|
| |
27
|
J. F. O'Callaghan and D. M. Mark. The extraction of drainage networks from digital elevation data. Computer Vision, Graphics and Image Processing, 28, 1984.
|
| |
28
|
|
 |
29
|
|
 |
30
|
|
| |
31
|
R. E. Tarjan. A class of algorithms that require nonlinear time to maintain disjoint sets. J. Comput. and Sys. Sci., 18:110--127, 1979.
|
 |
32
|
|
| |
33
|
|
 |
34
|
Marc van Kreveld , René van Oostrum , Chandrajit Bajaj , Valerio Pascucci , Dan Schikore, Contour trees and small seed sets for isosurface traversal, Proceedings of the thirteenth annual symposium on Computational geometry, p.212-220, June 04-06, 1997, Nice, France
[doi> 10.1145/262839.269238]
|
 |
35
|
|
| |
36
|
Afra J. Zomorodian , M. J. Ablowitz , S. H. Davis , E. J. Hinch , A. Iserles , J. Ockendon , P. J. Olver, Topology for Computing (Cambridge Monographs on Applied and Computational Mathematics), Cambridge University Press, New York, NY, 2005
|
CITED BY 3
|
|
Pankaj K. Agarwal , Lars Arge , Thomas Mølhave , Bardia Sadri, I/o-efficient efficient algorithms for computing contours on a terrain, Proceedings of the twenty-fourth annual symposium on Computational geometry, June 09-11, 2008, College Park, MD, USA
|
|
|
|
|
|
Andrew Danner , Thomas Mølhave , Ke Yi , Pankaj K. Agarwal , Lars Arge , Helena Mitasova, TerraStream: from elevation data to watershed hierarchies, Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems, November 07-09, 2007, Seattle, Washington
|
|