ACM Home Page
Please provide us with feedback. Feedback
I/O-efficient batched union-find and its applications to terrain analysis
Full text PdfPdf (626 KB)
Source Annual Symposium on Computational Geometry archive
Proceedings of the twenty-second annual symposium on Computational geometry table of contents
Sedona, Arizona, USA
SESSION: Session 6 (tuesday, june 6th--10:45 am-12:00 pm) table of contents
Pages: 167 - 176  
Year of Publication: 2006
ISBN:1-59593-340-9
Authors
Pankaj K. Agarwal  Duke University, Durham, NC
Lars Arge  University of Aarhus, Aarhus, Denmark and Duke University, Durham, NC
Ke Yi  Duke University, Durham, NC
Sponsors
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 39,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1137856.1137884
What is a DOI?

ABSTRACT

Despite extensive study over the last four decades and numerous applications, no I/O-efficient algorithm is known for the union-find problem. In this paper we present an I/O-efficient algorithm for the batched (off-line) version of the union-find problem. Given any sequence of N union and find operations, where each union operation joins two distinct sets, our algorithm uses O(sort(N)) = O(N/BlogM/BN/B) I/Os, where M is the memory size and B is the disk block size. This bound is asymptotically optimal in the worst case. If there are union operations that join a set with itself, our algorithm uses O(sort(N) + mst(N)) I/Os, where mst(N) is the number of I/Os needed to compute the minimum spanning tree of a graph with N edges. We also describe a simple and practical O(sort(N)log(N/M))-I/O algorithm for this problem, which we have implemented.We are interested in the union-find problem because of its applications in terrain analysis. A terrain can be abstracted as a height function defined over R2, and many problems that deal with such functions require a union-find data structure. With the emergence of modern mapping technologies, huge amount of elevation data is being generated that is too large to fit in memory, thus I/O-efficient algorithms are needed to process this data efficiently. In this paper, we study two terrain analysis problems that benefit from a union-find data structure: (i) computing topological persistence and (ii) constructing the contour tree. These structures have important applications such as terrain modeling, flow analysis, topological feature extraction, etc. We give the first O(sort(N))-I/O algorithms for these two problems, assuming that the input terrain is represented as a triangular mesh with N vertices.Finally, we report some preliminary experimental results, showing that our algorithms give order-of-magnitude improvement over previous methods on large data sets that do not fit in memory.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
P. K. Agarwal, L. Arge, and K. Yi. I/O-efficient construction of constrained Delaunay triangulations. In Proc. 13rd European Sympos. Algorithms, pages 355--366, 2005.
2
3
 
4
 
5
 
6
 
7
 
8
 
9
 
10
P. T. Bremer, V. Pascucci, H. Edelsbrunner, and B. Hamann. A topological hierarchy for functions on triangulated surfaces. IEEE Trans. Vis. Comput. Graphics, 10:385--396, 2004.
 
11
 
12
 
13
 
14
R. Dementiev, P. Sanders, D. Schultes, and J. Sibeyn. Engineering an external memory minimum spanning tree algorithm. In Proc. 3rd IFIP Intl. Conf. on Theoretical Computer Science, pages 195--208, 2004.
15
 
16
 
17
Environmental Systems Research Inc. ARC/INFO Professional GIS, 1997. Version 7.1.2.
 
18
F. W. Fredman and D. E. Willard. Trans-dichotomous algorithms for minimum spanning trees and shortest paths. In Proc. 31st IEEE Sympos. Found. Comput. Sci. pages 718--725, 1990.
19
 
20
H. Gabow and R. Tarjan. A linear time algorithm for a special case of disjoint set union. J. Comput. Syst. Sci., 30:209--221, 1985.
21
 
22
M. T. Goodrich, J.-J. Tsay, D. E. Vengroff, and J. S. Vitter. External-memory computational geometry. In Proc. 34th IEEE Sympos. Found. Comput. Sci., pages 714--723, 1993.
 
23
GRASS Development Team. GRASS GIS homepage. http://www.baylor.edu/grass/.
 
24
S. Jenson and J. Domingue. Extracting topographic structure from digital elevation data for geographic information system analysis. Photogrammetric Engineering and Remote Sensing, 54(11):1593--1600, 1988.
 
25
B. M. E. Moret and H. D. Shapiro. An empirical analysis of algorithms for constructing a minimum spanning tree. In Proc. Workshop Algorithms Data Struct., pages 400--411, 1991.
 
26
North Carolina Flood Mapping Program. http://www.ncfloodmaps.com.
 
27
J. F. O'Callaghan and D. M. Mark. The extraction of drainage networks from digital elevation data. Computer Vision, Graphics and Image Processing, 28, 1984.
 
28
29
30
 
31
R. E. Tarjan. A class of algorithms that require nonlinear time to maintain disjoint sets. J. Comput. and Sys. Sci., 18:110--127, 1979.
32
 
33
34
35
 
36


Collaborative Colleagues:
Pankaj K. Agarwal: colleagues
Lars Arge: colleagues
Ke Yi: colleagues