|
ABSTRACT
We show how to compute an LU factorization of a matrix when the factors of a leading principle submatrix are already known. The approach incorporates pivoting akin to partial pivoting, a strategy we call incremental pivoting. An implementation using the Formal Linear Algebra Methods Environment (FLAME) application programming interface (API) is described. Experimental results demonstrate practical numerical stability and high performance on an Intel Itanium2 processor-based server.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Bientinesi, P. and van de Geijn, R. 2006. Representing dense linear algebra algorithms: A farewell to indices. Tech. Rep. FLAME Working Note 17, CS-TR-2006-10, Department of Computer Sciences, The University of Texas at Austin.
|
| |
3
|
Cwik, T., van de Geijn, R., and Patterson, J. 1994. The application of parallel computation to integral equation models of electromagnetic scattering. J. Optic. Soc. Amer. A 11, 4 (Apr.), 1538--1545.
|
| |
4
|
Demmel, J. and Dongarra, J. 2005. LAPACK 2005 prospectus: Reliable and scalable software for linear algebra computations on high end computers. LAPACK Working Note 164 UT-CS-05-546, University of Tennessee. February.
|
| |
5
|
Geng, P., Oden, J. T., and van de Geijn, R. 1996. Massively parallel computation for acoustical scattering problems using boundary element methods. J. Sound Vibra. 191, 1, 145--165.
|
| |
6
|
Goto, K. 2004. TACC software and tools. http://www.tacc.utexas.edu/resources/software/.
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
Fred G. Gustavson , André Henriksson , Isak Jonsson , Bo Kågström , Per Ling, Superscalar GEMM-based Level 3 BLAS - The On-going Evolution of a Portable and High-Performance Library, Proceedings of the 4th International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems, p.207-215, June 14-17, 1998
|
| |
11
|
|
| |
12
|
Joffrain, T., Quintana-Ortí, E. S., and van de Geijn, R. A. 2005. Rapid development of high-performance out-of-core solvers. In Proceedings of the Workshop on Applied Parallel Computing (PARA 2004), J. Dongarra et al., eds. Lecture Notes in Computer Science, vol. 3732. Springer, 413--422.
|
| |
13
|
Kågström, B., Ling, P., and Loan, C. V. 1995. Gemm-Based level 3 blas: High-Performance model, implementations and performance evaluation benchmark. LAPACK Working Note no. 107 CS-95-315, University of Tennessee. November.
|
 |
14
|
|
| |
15
|
Klimkowski, K. and van de Geijn, R. 1995. Anatomy of an out-of-core dense linear solver. In Proceedings of the International Conference on Parallel Processing. vol. III - Algorithms and Applications, 29--33.
|
| |
16
|
|
| |
17
|
Stewart, G. W. 1998. Matrix Algorithms. Volume I: Basic Decompositions. SIAM, Philadelphia, PA.
|
| |
18
|
|
| |
19
|
|
 |
20
|
Sivan Toledo , Fred G. Gustavson, The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computations, Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference, p.28-40, May 27-27, 1996, Philadelphia, Pennsylvania, United States
[doi> 10.1145/236017.236029]
|
| |
21
|
|
| |
22
|
|
| |
23
|
Yip, E. L. 1979. Fortran subroutines for out-of-core solutions of large complex linear systems. Tech. Rep. CR-159142, NASA.
|
|