ACM Home Page
Please provide us with feedback. Feedback
Analysis and comparison of two general sparse solvers for distributed memory computers
Full text PdfPdf (1.03 MB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 27 ,  Issue 4  (December 2001) table of contents
Pages: 388 - 421  
Year of Publication: 2001
ISSN:0098-3500
Authors
Patrick R. Amestoy  ENSEEIHT-IRIT
Iain S. Duff  CERFACS and Rutherford Appleton Laboratoy
Jean-Yves L'excellent  ENSEEIHT-IRIT and LIP-ENS Lyon
Xiaoye S. Li  NERSC-Lawrence Berkeley National Laboratory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 73,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/504210.504212
What is a DOI?

ABSTRACT

This paper provides a comprehensive study and comparison of two state-of-the-art direct solvers for large sparse sets of linear equations on large-scale distributed-memory computers. One is a multifrontal solver called MUMPS, the other is a supernodal solver called superLU. We describe the main algorithmic features of the two solvers and compare their performance characteristics with respect to uniprocessor speed, interprocessor communication, and memory requirements. For both solvers, preorderings for numerical stability and sparsity play an important role in achieving high parallel efficiency. We analyse the results with various ordering algorithms. Our performance analysis is based on data obtained from runs on a 512-processor Cray T3E using a set of matrices from real applications. We also use regular 3D grid problems to study the scalability of the two solvers.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
AMESTOY,P.R.AND DUFF, I. S. 1993. Memory management issues in sparse multifrontal methods on multiprocessors. Int. J. Supercomput. Appl. 7, 64-82.
 
3
AMESTOY, P. R., DUFF,I.S.,AND L' EXCELLENT, J.-Y. 2000. Multifrontal parallel distributed symmetric and unsymmetric solvers. Comput. Methods Appl. Mech. Eng., 501-520.
 
4
 
5
AMESTOY, P. R., DUFF,I.S.,AND PUGLISI, C. 1996. Multifrontal QR factorization in a multiprocessor environment. Int. J. Num. Lin. Alg. Appl. 3(4), 275-300.
 
6
AMESTOY,P.R.,AND PUGLISI, C. 2000. An unsymmetrized multifrontal LU factorization. Tech. Rep. RT/APO/00/3, ENSEEIHT-IRIT. Also Lawrence Berkeley National Laboratory report LBNL-46474.
 
7
ARIOLI, M., DEMMRL,J.,AND DUFF, I. S. 1989. Solving sparse linear systems with sparse backward error. SIAM J. Matrix Anal. Appl. 10, 165-190.
 
8
ASHCRAFT,C.AND GRIMES, R. G. 1999. SPOOLES: An object oriented sparse matrix library. In Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing. San Antonio, Texas. http://www.netlib.org/linalg/spooles.
 
9
 
10
BORCHARDT, J., GRUND,F.,AND HORN, D. 1997. Parallel numerical methods for large systems of differential-algebraic equations in industrial applications. Tech. Rep. 382, Weierstra-Institut fur Angewandte Analysis und Stochastik, Berlin.
 
11
 
12
 
13
DUFF,I.S.,GRIMES,R.G.,AND LEWIS, J. G. 1997. The Rutherford-Boeing Sparse Matrix Collection. Tech. Rep. RAL-TR-97-031, Rutherford Appleton Laboratory.
 
14
 
15
 
16
 
17
 
18
GUPTA, A. 2000. WSMP: Watson Sparse Matrix Package. Tech. rep., IBM research division, T. J. Watson Research Center, Yorktown Heights. http://www.cs.umn.edu/ agupta/wsmp.html.
 
19
 
20
 
21
HEATH,M.T.AND RAGHAVAN, P. 1997. Performance of a fully parallel sparse solver. Int. J. Supercomput. Appl. 11, 1, 49-64.
 
22
 
23
HSL. 2000. A collection of Fortran codes for large scale scientific computation. http://www. cse.clrc.ac.uk/Activity/HSL.
 
24
KARYPIS,G.AND KUMAR, V. 1998. METIS "A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-Reducing Orderings of Sparse Matrices" Version 4.0. University of Minnesota.
 
25
 
26
LI,X.S.AND DEMMEL, J. W. 1999. A scalable sparse direct solver using static pivoting. In Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing. San Antonio, Texas. http://www.siam.org/meetings/pp99/.
 
27
LI,X.S.,DEMMEL,J.W.,BAILEY, D. H., HENRY, G., HIDA, Y., ISKANDAR, J., KAHAN, W., KAPUR, A., MARTIN,M.C.,TUNG,T.,AND YOO, D. J. 2000. Design, Implementation and Testing of Extended and Mixed Precision BLAS. Tech. Rep. LBNL-45991, Lawrence Berkeley National Laboratory, Berkeley. June. (also LAPACK Working Note #149). Submitted to ACM Trans. Math. Soft.
28
 
29
NAGEL, W. E., ARNOLD, A., WEBER, M., HOPPE, H.-C., AND SOLCHENRACH, K. 1996. VAMPIR: Visualization and Analysis of MPI resources. Supercomput. 12, 1 (Jan.), 69-80.
 
30
 
31
ROTHBEEG, E. 1994 Efficient sparse Cholesky factorization on distributed-memory multiprocessors. In Proceedings Fifth SIAM Conference on Applied Linear Algebra, J. Lewis Ed. SIAM Press, Philadelphia, 141.
 
32
SCHENK, O., GARTNER, K., AND FICHTNER, W. 2000 Efficient sparse LU factorization with left-right looking strategy on shared memory multiprocessors. BIT 40, 1, 158-176.

CITED BY  9


REVIEW

"Maurice W. Benson : Reviewer"

The authors present a detailed comparison of two direct sparse linear system solvers, MUMPS and SuperLU, in distributed memory implementations using message passing. These two methods form a good basis for study, since they illustrate different st  more...

Collaborative Colleagues:
Patrick R. Amestoy: colleagues
Iain S. Duff: colleagues
Jean-Yves L'excellent: colleagues
Xiaoye S. Li: colleagues