|
ABSTRACT
In this paper we apply a formal approach for the derivation of dense linear algebra algorithms to the triangular Sylvester equation. The result is a large family of provably correct algorithms. By using a coding style that reflects the algorithms as they are naturally presented, the correctness of the algorithms carries through to the correctness of the implementations. Analytically motivated heuristics are used to subsequently choose members from the family that can be expected to yield high performance. Finally, we report performance on the Intel (R) Pentium (R) III processor that is competitive with that of recursive algorithms reported previously in the literature for this operation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Aliev, F. and Larin, V. 1998. Optimization of Linear Control Systems: Analytical Methods and Computational Algorithms. Stability and Control: Theory, Methods and Applications, vol. 8. Gordon and Breach, New York, NY.
|
| |
2
|
E. Anderson , Z. Bai , C. Bischof , J. Demmel , J. Dongarra , J. Du Croz , A. Greenbaum , S. Hammarling , A. McKenney , S. Ostrouchov , D. Sorensen, LAPACK's user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1992
|
 |
3
|
|
| |
4
|
Bientinesi, P., Gunnels, J. A., Myers, M. E., Quintana-Orí, E. S., and van de Geijn, R. A. 2002. The science of deriving dense linear algebra algorithms. Tech. Rep. CS-TR-02-53, Department of Computer Sciences, The University of Texas at Austin, Austin, TX. Available online at http://www.cs.utexas.edu/users/flame/.
|
 |
5
|
Jeff Bilmes , Krste Asanovic , Chee-Whye Chin , Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Proceedings of the 11th international conference on Supercomputing, p.340-347, July 07-11, 1997, Vienna, Austria
[doi> 10.1145/263580.263662]
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
| |
10
|
|
| |
11
|
Golub, G. H., Nash, S., and Van Loan, C. F. 1979. A Hessenberg--Schur method for the problem AX + XB = C. IEEE Trans. Automat. Control AC-24, 6, 909--913.
|
 |
12
|
|
| |
13
|
|
| |
14
|
Hammarling, S. J. 1982. Numerical solution of the stable, non-negative definite Lyapunov equation. IMA J. Numer. Anal. 2, 303--323.
|
| |
15
|
|
| |
16
|
Jonsson, I. and Kågström, B. 2001. Recursive blocked algorithms for solving triangular matrix equations---part I: one-sided and coupled Sylvester equations. Department of Computing Science and HPC2N Report UMINF-01.05. Umeå University, Umeå, Sweden.
|
| |
17
|
|
 |
18
|
|
| |
19
|
Sima, V. 1996. Algorithms for Linear-Quadratic Optimization. Pure and Applied Mathematics, vol. 200. Marcel Dekker. New York, NY.
|
| |
20
|
The MathWorks, Inc. 1993. The MATLAB Control Toolbox, Version 3.0b. The MathWorks, Inc., Natick, MA.
|
| |
21
|
|
CITED BY 8
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jia Guo , Ganesh Bikshandi , Basilio B. Fraguela , Maria J. Garzaran , David Padua, Programming with tiles, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, February 20-23, 2008, Salt Lake City, UT, USA
|
|
|
Gregorio Quintana-Ortí , Francisco D. Igual , Enrique S. Quintana-Ortí , Robert A. van de Geijn, Solving dense linear systems on platforms with multiple hardware accelerators, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|