ACM Home Page
Please provide us with feedback. Feedback
SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems
Full text PdfPdf (659 KB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 29 ,  Issue 2  (June 2003) table of contents
Pages: 110 - 140  
Year of Publication: 2003
ISSN:0098-3500
Authors
Xiaoye S. Li  Lawrence Berkeley National Laboratory, Berkeley, CA
James W. Demmel  University of California at Berkeley, Berkeley, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 66,   Citation Count: 18
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/779359.779361
What is a DOI?

ABSTRACT

We present the main algorithmic features in the software package SuperLU_DIST, a distributed-memory sparse direct solver for large sets of linear equations. We give in detail our parallelization strategies, with a focus on scalability issues, and demonstrate the software's parallel performance and scalability on current machines. The solver is based on sparse Gaussian elimination, with an innovative static pivoting strategy proposed earlier by the authors. The main advantage of static pivoting over classical partial pivoting is that it permits a priori determination of data structures and communication patterns, which lets us exploit techniques used in parallel sparse Cholesky algorithms to better parallelize both LU decomposition and triangular solution on large-scale distributed machines.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alvarado, F. L., Pothen, A., and Schreiber, R. 1993. Highly parallel sparse triangular solution. In Graph Theory and Sparse Matrix Computation, A. George, J. R. Gilbert, and J. W. Liu, Eds. Springer-Verlag, New York, NY, 159--190.
 
2
 
3
Amestoy, P. R. and Duff, I. S. 1993. Memory management issues in sparse multifrontal methods on multiprocessors. Internat. J. Supercomput. Appl. 7, 1 (Spring), 64--82.
 
4
5
 
6
Amestoy, P. R., Li, X. S., and Ng, E. G. 2003. Diagonal markowitz scheme with local symmetrization. Tech. rep., Lawrence Berkeley National Laboratory. In preparatioon.
 
7
 
8
Arioli, M., Demmel, J. W., and Duff, I. S. 1989. Solving sparse linear systems with sparse backward error. SIAM J. Matrix Anal. Appl. 10, 2 (April), 165--190.
 
9
Ashcraft, C. and Grimes, R. G. 1999. SPOOLES: An object oriented sparse matrix library. In Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing. (San Antonio, TX. Available online at http://www.netlib.org/linalg/spooles.)
 
10
11
 
12
 
13
 
14
 
15
Davis, T. A. n.d. University of Florida sparse matrix collection. Available online at http://www.cise.ufl.edu/∼davis/sparse.
16
 
17
Davis, T. A., Gilbert, J. R., Larimore, S. I., and Ng, E. 2000. A column approximate minimum degree ordering algorithm. Tech. Rep. TR-00-005, Computer and Information Sciences Department, University of Florida, Gainesville, FL. Submitted to ACM Trans. Math. Software.
 
18
 
19
 
20
 
21
 
22
Duff, I., Grimes, R., and Lewis, J. 1992. Users' guide for the Harwell-Boeing sparse matrix collection (release 1). Tech. Rep. RAL-92-086, Rutherford Appleton Laboratory, Chilton, Didcot, Oxfordshire, U.K.
 
23
 
24
 
25
George, A. 1973. Nested dissection of a regular finite element mesh. SIAM J. Numer. Anal. 10, 345--363.
 
26
 
27
 
28
 
29
 
30
 
31
 
32
Gupta, A. 2000. WSMP: Watson Sparse Matrix Package. Tech. Rep., IBM Research Division, T.J. Watson Research Center, Yorktown Heights, NY. Available online at http://www.cs.umn. edu/∼agupta/wsmp.html.
 
33
Gupta, A. 2001. Improved symbolic and numerical factorization algorithms for unsymmetric sparse matrices. Tech. Rep. RC 22137 (99131), IBM Research Center, Yorktown Heights, NY.
 
34
 
35
Gupta, A. and Kumar, V. 1995. Optimally scalable parallel sparse cholesky factorization. In The 7th SIAM Conference on Parallel Processing for Sci. Comput. 442--447.
 
36
Heath, M. T. and Raghavan, P. 1997. Performance of a fully parallel sparse solver. Internat. J. Supercomput. Appl. 11, 1, 49--64.
 
37
Hendrickson, B. and Leland, R. 1993. The CHACO User's Guide. Version 1.0. Tech. Rep. SAND93-2339 • UC-405, Sandia National Laboratories, Albuquerque, NM.
 
38
 
39
HSL. 2000. A collection of Fortran codes for large scale scientific computation. Available online at http://www.cse.clrc.ac.uk/Activity/HSL.
 
40
 
41
Karypis, G. and Kumar, V. 1998. MeTiS---A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-Reducing Orderings of Sparse Matrices---Version 4.0. University of Minnesota, Minneapolis, MN.
 
42
 
43
Lehoucq, R., Maschhoff, K., Sorensen, D., and Yang, C. n.d. Parallel ARPACK. Available online at http://www.caam.rice.edu/∼kristyn/parpack_home.html.
 
44
 
45
Li, X. S. and Demmel, J. W. 2002. SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems. Tech. Rep. LBNL-49388, Lawrence Berkeley National Laboratory, Berkeley, CA.
46
47
 
48
 
49
Markowitz, H. M. 1957. The elimination form of the inverse and its application to linear programming. Management Sci. 3, 255--269.
 
50
MPI. n.d. Message Passing Interface (MPI) forum. Available online at http://www.mpi-forum.org/.
 
51
Olshowka, M. and Neumaier, A. 1996. A new pivoting strategy for Gaussian elimination. Linear Algebra Appl. 240, 131--151.
 
52
Raghavan, P. 1995. Efficient parallel sparse triangular solution with selective inversion. Tech. Rep. CS-95-314, Department of Computer Science, University of Tennessee, Knoxville, TN.
 
53
Rescigno, T. N., Baertschy, M., Isaacs, W. A., and McCurdy, C. W. 1999. Collisional breakup in a quantum system of three charged particles. Science 286, 5449, 2474--2479.
 
54
Riedy, J. 2003. Parallel bipartite matching for sparse matrix computation. In preparation.
 
55
 
56
Saad, Y. n.d. SPARSKIT: A basic tool-kit for sparse matrix computations (Version 2). University of Minnesota, Minneapolis, MN. Available online at http://www.cs.umn.edu/Research/arpa/SPARSKIT/sparskit.html.
 
57
 
58
Schenk, O., Gärtner, K., and Fichtner, W. 2000. Efficient sparse LU factorization with left--right looking strategy on shared memory multiprocessors. BIT 40, 1, 158--176.

CITED BY  18

Collaborative Colleagues:
Xiaoye S. Li: colleagues
James W. Demmel: colleagues