|
ABSTRACT
In this paper we present a parallel formulation of a multilevel k-way graph partitioning algorithm. The multilevel k-way partitioning algorithm reduces the size of the graph by collapsing vertices and edges (coarsening phase), finds a k-way partition of the smaller graph, and then it constructs a k-way partition for the original graph by projecting and refining the partition to successively finer graphs (uncoarsening phase). A key innovative feature of our parallel formulation is that it utilizes graph coloring to effectively parallelize both the coarsenin and the refinement during the uncoarsening phase. Our algorithm is able to achieve a high degree of concurrency, while maintaining the high quality partitions produced by the serial algorithm. We test our scheme on a large number of graphs from finite element methods, and transportation domains. Our parallel formulation on Cray T3D, produces high quality 128-way partitions on 128 processors in a little over two seconds, for graphs with a million vertices. Thus our parallel algorithm makes it possible to perform dynamic graph partition in adaptive computations without compromising quality.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Stephen T. Barnard and Horst Simon. A parallel implementation of multilevel recursive spectral bisection for application to adaptive unstructured meshes. In Proceedings of the seventh SIAM conference on Parallel Processing for Scientific Computing, pages 627-632, 1995.
|
| |
3
|
Stephen T. Barnard and Horst D. Simon. A fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems. In Proceedings of the sixth SIAM conference on Parallel Processing for Scientific Computing, pages 711-718, 1993.
|
| |
4
|
T. Bui and C. Jones. A heuristic for reducing fill in sparse matrix factorization. In 6th SIAM Conf. Parallel Processing for Scientific Computing, pages 445-452, 1993.
|
| |
5
|
Chung-Kuan Cheng and Yen-Chuen A. Wei. An improved two-way partitioning algorithm with stable performance. IEEE Transactions on Computer Aided Design, 10(12):1502-1511, December 1991.
|
| |
6
|
Pedro Diniz, Steve Plimpton, Bruce Hendrickson, and Robert Leland. Parallel algorithms for dynamically partitioning unstructured grids. In Proceedings of the seventh SIAM conference on Parallel Processing for Scientific Computing, pages 615-620, 1995.
|
| |
7
|
J. Garbers, H. J. Promel, and A. Steger. Finding clusters in VLSI circuits. In Proceedings of IEEE International Conference on Computer Aided Design, pages 520-523, 1990.
|
| |
8
|
A. George. Nested dissection of a regular finite-element mesh. SIAM Journal on Numerical Ananlysis, 10:345-363, 1973.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
Lars Hagen and Andrew Kahng. Fast spectral methods for ratio cut partitioning and clustering. In Proceedings of IEEE International Conference on Computer Aided Design, pages 10-13, 1991.
|
| |
13
|
|
| |
14
|
|
| |
15
|
Bruce Hendrickson and Robert Leland. A multilevel algorithm for partitioning graphs. Technical Report SAND93-1301, Sandia National Laboratories, 1993.
|
| |
16
|
Bruce Hendrickson and Robert Leland. The chaco user's guide, version 2.0. Technical Report SAND94-2692, Sandia National Laboratories, 1994.
|
| |
17
|
Bruce Hendrickson and Edward Rothberg. Improving the runtime and quality of nested dissection ordering. Technical Report CS-96-000, Sandia National Laboratories, 1996.
|
| |
18
|
Zdenek Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J. R. Hughes. Finite element methods on the connection machine cm-5 system. Technical report, Thinking Machines Corporation, 1993.
|
| |
19
|
G. Karypis and V. Kumar. Analysis of multilevel graph partitioning. Technical Report TR 95-037, Department of Computer Science, University of Minnesota, 1995. Also available on WWW at URL http://www.cs.umn.edu/~karypis/papers/mlevel_analysis.ps. A short version appears in Supercomputing 95.
|
| |
20
|
G. Karypis and V. Kumar. A fast and high quality multilevel scheme for partitioning irregular graphs. Technical Report TR 95-035, Department of Computer Science, University of Minnesota, 1995. Also available on WWW at URL http://www.cs.umn.edu/~karypis/papers/mlevel_serial.ps. A short version appears in Intl. Conf. on Parallel Processing 1995.
|
| |
21
|
G. Karypis and V. Kumar. Multilevel k-way partitioning scheme for irregular graphs. Technical Report TR 95-064, Department of Computer Science, University of Minnesota, 1995. Also available on WWW at URL http://www.cs.umn.edu/~karypis/papers/mlevel_kway.ps.
|
| |
22
|
|
| |
23
|
|
| |
24
|
George Karypis and Vipin Kumar. Fast sparse Cholesky factorization on scalable parallel computers. Technical report, Department of Computer Science, University of Minnesota, Minneapolis, MN, 1994. A short version appears in the Eighth Symposium on the Frontiers of Massively Parallel Computation, 1995. Available on WWW at URL http://www.cs.umn.edu/~karypis/papers/frontiers95.ps.
|
| |
25
|
B. W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. The Bell System Technical Journal, 1970.
|
| |
26
|
|
| |
27
|
|
| |
28
|
Gary L. Miller, Shang-Hua Teng, W. Thurston, and Stephen A. Vavasis. Automatic mesh partitioning. In A. George, John R. Gilbert, and J. W.-H. Liu, editors, Sparse Matrix Computations: Graph Theory Issues and Algorithms. (An IMA Workshop Volume). Springer-Verlag, New York, NY, 1993.
|
| |
29
|
|
| |
30
|
B. Nour-Omid, A. Raefsky, and G. Lyzenga. Solving finite element equations on concurrent computers. In A. K. Noor, editor, American Soc. Mech. Eng, pages 291-307, 1986.
|
 |
31
|
N. Mansour , R. Ponnusamy , A. Choudhary , G. C. Fox, Graph contraction for physical optimization methods: a quality-cost tradeoff for mapping data on parallel computers, Proceedings of the 7th international conference on Supercomputing, p.1-10, July 19-23, 1993, Tokyo, Japan
[doi> 10.1145/165939.165942]
|
| |
32
|
|
| |
33
|
|
| |
34
|
Edward Rothberg. Performance of panel and block approaches to sparse Cholesky factorization on the iPSC/860 and Paragon multicomputers. In Proceedings of the 1994 Scalable High Performance Computing Conference, May 1994.
|
| |
35
|
|
CITED BY 10
|
|
|
|
|
|
|
|
|
|
|
Nilesh Choudhury , Yogesh Mehta , Terry L. Wilmarth , Eric J. Bohm , Laxmikant V. Kalé, Scaling an optimistic parallel simulation of large-scale interconnection networks, Proceedings of the 37th conference on Winter simulation, December 04-07, 2005, Orlando, Florida
|
|
|
|
|
|
|
|
|
|
|
|
Anh Vo , Sarvani Vakkalanka , Michael DeLisi , Ganesh Gopalakrishnan , Robert M. Kirby , Rajeev Thakur, Formal verification of practical MPI programs, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|
|
|
|
|
|
|