|
ABSTRACT
We continue our study of high-performance algorithms for solving triangular matrix equations. They appear naturally in different condition estimation problems for matrix equations and various eigenspace computations, and as reduced systems in standard algorithms. Building on our successful recursive approach applied to one-sided matrix equations (Part I), we now present novel recursive blocked algorithms for two-sided matrix equations, which include matrix product terms such as AXBT. Examples are the discrete-time standard and generalized Sylvester and Lyapunov equations. The means for achieving high performance is the recursive variable blocking, which has the potential of matching the memory hierarchies of today's high-performance computing systems, and level-3 computations which mainly are performed as GEMM operations. Different implementation issues are discussed, including the design of efficient new algorithms for two-sided matrix products. We present uniprocessor and SMP parallel performance results of recursive blocked algorithms and routines in the state-of-the-art SLICOT library. Although our recursive algorithms with optimized kernels for the two-sided matrix equations perform more operations, the performance improvements are remarkable, including 10-fold speedups or more, compared to standard algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Anderson , Z. Bai , C. Bischof , L. S. Blackford , J. Demmel , Jack J. Dongarra , J. Du Croz , S. Hammarling , A. Greenbaum , A. McKenney , D. Sorensen, LAPACK Users' guide (third ed.), Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999
|
 |
2
|
|
| |
3
|
Chu, K.-W. E. 1987. The solution of the matrix equation AXB - CXD = Y and (YA − DZ, YC − BZ) = (E,F). Linear Algebra Appl. 93, 93--105.
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
Golub, G., Nash, S., and Van Loan, C. 1979. A Hessenberg--Schur method for the matrix problem AX + XB = C. IEEE Trans. Autom. Contr. AC-24, 6, 909--913.
|
| |
8
|
Hager, W. W. 1984. Condition estimates. SIAM J. Sci. Stat. Comp. 5, 311--316.
|
| |
9
|
Hammarling, S. J. 1982. Numerical solution of the stable, non-negative definite Lyapunov equation. IMA J. Numer. Anal. 2, 303--323.
|
 |
10
|
|
| |
11
|
Higham, N. J. 1993. Perturbation theory and backward error for AX − XB = C. BIT 33, 124--136.
|
| |
12
|
Jonsson, I. and Kågström, B. 2001. Recursive blocked algorithms for solving triangular matrix equations---Part II: Two-sided and generalized Sylvester and Lyapunov equations. SLICOT Working Note 2001-5. Department of Computing Science, Umeå University, SE-901 87 Umeå, Sweden.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
Kågström, B. and Westin, L. 1989. Generalized Schur methods with condition estimators for solving the generalized Sylvester equation. IEEE Trans. Autom. Contr. 34, 4, 745--751.
|
| |
17
|
Kressner, D., Mehrmann, V., and Penzl, T. 1999a. CTLEX---A collection of benchmark examples for continuous-time Lyapunov equations. SLICOT Working Note 1999--6.
|
| |
18
|
Kressner, D., Mehrmann, V., and Penzl, T. 1999b. DTLEX---A collection of benchmark examples for discrete-time Lyapunov equations. SLICOT Working Note 1999--7.
|
| |
19
|
Penzl, T. 1998. Numerical solution of generalized Lyapunov equations, Adv. Comp. Math. 8, 33--48.
|
| |
20
|
SLICOT 2001. Library and the numerics in control network (NICONET) Web site: www.win. tue.nl/niconet/index.html.
|
INDEX TERMS
Primary Classification:
G.
Mathematics of Computing
G.4
MATHEMATICAL SOFTWARE
Subjects:
Algorithm design and analysis
Additional Classification:
F.
Theory of Computation
F.2
ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY
F.2.1
Numerical Algorithms and Problems
Subjects:
Computations on matrices
G.
Mathematics of Computing
G.1
NUMERICAL ANALYSIS
G.1.3
Numerical Linear Algebra
Subjects:
Conditioning;
Linear systems (direct and iterative methods)
G.4
MATHEMATICAL SOFTWARE
Subjects:
Parallel and vector implementations;
Efficiency;
Reliability and robustness
General Terms:
Algorithms,
Design,
Performance
Keywords:
GEMM-based,
LAPACK,
Matrix equations,
SLICOT,
SMP parallelization,
automatic blocking,
generalized Sylvester and Lyapunov,
level-3 BLAS,
recursion,
standard discrete-time Sylvester and Lyapunov,
superscalar
|