|
ABSTRACT
Triangular matrix equations appear naturally in estimating the condition numbers of matrix equations and different eigenspace computations, including block-diagonalization of matrices and matrix pairs and computation of functions of matrices. To solve a triangular matrix equation is also a major step in the classical Bartels--Stewart method for solving the standard continuous-time Sylvester equation (AX − XB = C). We present novel recursive blocked algorithms for solving one-sided triangular matrix equations, including the continuous-time Sylvester and Lyapunov equations, and a generalized coupled Sylvester equation. The main parts of the computations are performed as level-3 general matrix multiply and add (GEMM) operations. In contrast to explicit standard blocking techniques, our recursive approach leads to an automatic variable blocking that has the potential of matching the memory hierarchies of today's HPC systems. Different implementation issues are discussed, including when to terminate the recursion, the design of new optimized superscalar kernels for solving leaf-node triangular matrix equations efficiently, and how parallelism is utilized in our implementations. Uniprocessor and SMP parallel performance results of our recursive blocked algorithms and corresponding routines in the state-of-the-art libraries LAPACK and SLICOT are presented. The performance improvements of our recursive algorithms are remarkable, including 10-fold speedups compared to standard algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Anderson , Z. Bai , C. Bischof , L. S. Blackford , J. Demmel , Jack J. Dongarra , J. Du Croz , S. Hammarling , A. Greenbaum , A. McKenney , D. Sorensen, LAPACK Users' guide (third ed.), Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999
|
 |
2
|
|
 |
3
|
|
| |
4
|
Chu, K.-W. E. 1987. The solution of the matrix equation AXB − CXD = Y and (YA − DZ, YC − BZ) = (E,F). Linear Algebra Appl. 93, 93--105.
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
Golub, G., Nash, S., and Van Loan, C. 1979. A Hessenberg--Schur method for the matrix problem AX + XB = C. IEEE Trans. Autom. Contr. AC-24, 6, 909--913.
|
| |
12
|
|
| |
13
|
|
| |
14
|
Fred G. Gustavson , André Henriksson , Isak Jonsson , Bo Kågström , Per Ling, Recursive Blocked Data Formats and BLAS's for Dense Linear Algebra Algorithms, Proceedings of the 4th International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems, p.195-206, June 14-17, 1998
|
| |
15
|
Fred G. Gustavson , André Henriksson , Isak Jonsson , Bo Kågström , Per Ling, Superscalar GEMM-based Level 3 BLAS - The On-going Evolution of a Portable and High-Performance Library, Proceedings of the 4th International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems, p.207-215, June 14-17, 1998
|
| |
16
|
Gustavson, F. and Jonsson, I. 2000. Minimal-storage high-performance Cholesky factorization via blocking and recursion, IBM J. Res. Dev. 44, 6 (Nov.), 823--849.
|
| |
17
|
Hager, W. W. 1984. Condition estimates. SIAM J. Sci. Stat. Comp. 5, 311--316.
|
| |
18
|
Hammarling, S. J. 1982. Numerical solution of the stable, non-negative definite Lyapunov equation. IMA J. Numer. Anal. 2, 303--323.
|
 |
19
|
|
| |
20
|
Higham, N. J. 1993. Perturbation theory and backward error for AX -- XB = C. BIT 33, 124--136.
|
| |
21
|
|
| |
22
|
|
| |
23
|
Jonsson, I. and Kågström, B. 2001b. Recursive blocked algorithms for solving triangular matrix equations---Part I: One-sided and coupled Sylvester-type equations. SLICOT Working Note 2001-4. Dept. of Computing Science, Umeå University, SE-901 87 Umeå, Sweden.
|
 |
24
|
|
| |
25
|
Kågström, B. 1977a. Bounds and perturbation bounds for the matrix exponential. BIT 17, 39--57.
|
| |
26
|
Kågström, B. 1977b. Numerical computation of matrix functions. Tech. Rep. UMINF-58.77, Institute of Information Processing, Umeå University, S-901 87 Umeå , Sweden.
|
| |
27
|
Kågström, B. 1993. A direct method for reordering eigenvalues in the generalized real Schur form of a regular matrix pair (A, B). In Linear Algebra for Large Scale and Real--Time Applications, Kluwer Academic, Amsterdam, 195--218.
|
| |
28
|
|
 |
29
|
|
 |
30
|
|
| |
31
|
|
 |
32
|
|
| |
33
|
Kågström, B. and Poromaa, P. 1996b. Computing eigenspaces with specified eigenvalues of a regular matrix pair (A, B) and condition estimation. Numer. Alg. 12, 369--407.
|
| |
34
|
Kågström, B. and Van Dooren, P. 1992. A generalized state-space approach for the additive decomposition of a transfer matrix. Int. J. Numer. Linear Algebra Appl. 1, 165--181.
|
| |
35
|
Kågström, B., and Westin, L. 1989. Generalized Schur methods with condition estimators for solving the generalized Sylvester equation. IEEE Trans. Autom. Contr. 34, 4, 745--751.
|
| |
36
|
Kressner, D., Mehrmann, V., and Penzl, T. 1999a. CTLEX---A collection of benchmark examples for continuous-time Lyapunov equations. SLICOT Working Note 1999-6.
|
| |
37
|
Kressner, D., Mehrmann, V., and Penzl, T. 1999b. DTLEX---A collection of benchmark examples for discrete-time Lyapunov equations. SLICOT Working Note 1999-7.
|
| |
38
|
Lindkvist, A. 2000. High-performance recursive BLAS kernels using new data formats for the QR factorization. Master's Thesis, Rep. UMNAD-325.00, Department of Computing Science, Umeå University, SE-901 87 Umeå, Sweden.
|
| |
39
|
Moler, C. B. and Stewart, G. W. 1973. An algorithm for generalized matrix eigenvalue problems. SIAM J. Numer. Anal. 10, 241--256.
|
| |
40
|
Moler, C. B. and Van Loan, C. F. 1978. Nineteen dubious ways to compute the exponential of a matrix. SIAM Rev. 20, 801--836.
|
| |
41
|
|
| |
42
|
OpenMP 2000. Fortran Application Program Interface, Version 2.0, November, www.openmp. org/specs/.
|
| |
43
|
Penzl, T. 1998. Numerical solution of generalized Lyapunov equations. Adv. Comp. Math. 8, 33--48.
|
| |
44
|
Poromaa, P. 1997. High performance computing: Algorithms and library software for Sylvester equations and certain eigenvalue problems with applications in condition estimation. PhD Thesis UMINF-97.16, Department of Computing Science, Umeå University, SE-901 87 Umeå, Sweden, June.
|
| |
45
|
|
| |
46
|
SLICOT 2001. Library and the numerics in control network (NICONET) Web site: www.win. tue.nl/niconet/index.html, Academic Press, San Diego, Calif.
|
| |
47
|
Stewart, G. W. and Sun, J-G. 1990. Matrix Perturbation Theory. Academic, San Diego.
|
| |
48
|
|
| |
49
|
Van Loan, C. F. 1977. The sensitivity of the matrix exponential. SIAM J. Numer. Anal. 14, 971--981.
|
INDEX TERMS
Primary Classification:
G.
Mathematics of Computing
G.4
MATHEMATICAL SOFTWARE
Subjects:
Algorithm design and analysis
Additional Classification:
F.
Theory of Computation
F.2
ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY
F.2.1
Numerical Algorithms and Problems
Subjects:
Computations on matrices
G.
Mathematics of Computing
G.1
NUMERICAL ANALYSIS
G.1.3
Numerical Linear Algebra
Subjects:
Conditioning;
Linear systems (direct and iterative methods)
G.4
MATHEMATICAL SOFTWARE
Subjects:
Efficiency;
Parallel and vector implementations;
Reliability and robustness
General Terms:
Algorithms,
Performance
Keywords:
GEMM-based,
LAPACK,
Matrix equations,
SLICOT,
SMP parallelization,
automatic blocking,
generalized coupled Sylvester,
level-3 BLAS,
recursion,
standard Sylvester and Lyapunov,
superscalar
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|