| An extended set of FORTRAN basic linear algebra subprograms |
| Full text |
Pdf
(1.14 MB)
|
| Source
|
ACM Transactions on Mathematical Software (TOMS)
archive
Volume 14 , Issue 1 (March 1988)
table of contents
Pages: 1 - 17
Year of Publication: 1988
ISSN:0098-3500
|
|
Authors
|
|
Jack J. Dongarra
|
Argonne National Laboratory, Argonne, IL
|
|
Jeremy Du Croz
|
Numerical Algorithms Group, Ltd., Oxford, UK
|
|
Sven Hammarling
|
Numerical Algorithms Group, Ltd., Oxford, UK
|
|
Richard J. Hanson
|
Applied Dynamics International Corporation, Ann Arbor, MI
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 21, Downloads (12 Months): 127, Citation Count: 106
|
|
|
ABSTRACT
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable implementations of algorithms for high-performance computers.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
DONGARRA, J. J., KAUFMAN, L., AND HAMMARLING, S. Squeezing the most out of eigenvalue solvers on high-performance computers. Linear Algebra Appl. 77 (1986), 113-136.
|
| |
5
|
DONGARRA, J. J., BUNCH, J. R., MOLER, C. B., AND STEWART, G.W. LINPACK Users'Guide. SIAM, Philadelphia, Pa., 1979.
|
 |
6
|
|
 |
7
|
|
| |
8
|
GARBOW, B. S., BOYLE, J. M., DONGARRA, J. J., AND MOLER, C. B. Matrix Eigensystem Routines--EISPACK Guide Extension. Lecture Notes in Computer Science, vol. 51. Springer- Verlag, New York, 1977.
|
 |
9
|
|
| |
10
|
IEEE. IEEE standard for binary floating-point arithmetic. ANSI/IEEE Stand. 754-1985, IEEE, New York, 1985.
|
 |
11
|
|
 |
12
|
|
| |
13
|
SMITH, B. T., BOYLE, J. M., DONGARRA, J. J., GARBOW, B. S., IKEBE, Y., KLEMA, V. C., AND MOLER, C. B. Matrix Eigensystem RoutinesmEISPACK Guide. Lecture Notes in Computer Science, vol. 6. 2nd ed., Springer-Verlag, New York, 1976.
|
CITED BY 106
|
|
|
|
|
|
|
|
|
|
|
|
|
|
E. Anderson , Z. Bai , J. Dongarra , A. Greenbaum , A. McKenney , J. Du Croz , S. Hammerling , J. Demmel , C. Bischof , D. Sorensen, LAPACK: a portable linear algebra library for high-performance computers, Proceedings of the 1990 conference on Supercomputing, p.2-11, October 1990, New York, New York, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Laura Susan Blackford , J. Choi , A. Cleary , A. Petitet , R. C. Whaley , J. Demmel , I. Dhillon , K. Stanley , J. Dongarra , S. Hammarling , G. Henry , D. Walker, ScaLAPACK: a portable linear algebra library for distributed memory computers - design issues and performance, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), p.5-es, January 01-01, 1996, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
G.-S. Karamanos , C. Evangelinos , R. C. Boes , R. M. Kirby , G. E. Karniadakis, Direct numerical simulation of turbulence with a PC/linux cluster: fact or fiction?, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.53-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
L. S. Blackford , A. Cleary , A. Petitet , R. C. Whaley , J. Demmel , I. Dhillon , H. Ren , K. Stanley , J. Dongarra , S. Hammarling, Practical experience in the numerical dangers of heterogeneous computing, ACM Transactions on Mathematical Software (TOMS), v.23 n.2, p.133-147, June 1997
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gregor Kiczales , John Lamping , Christina Videira Lopes , Chris Maeda , Anurag Mendhekar , Gail Murphy, Open implementation design guidelines, Proceedings of the 19th international conference on Software engineering, p.481-490, May 17-23, 1997, Boston, Massachusetts, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sivan Toledo , Fred G. Gustavson, The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computations, Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference, p.28-40, May 27-27, 1996, Philadelphia, Pennsylvania, United States
|
|
|
|
|
|
Xiaoye S. Li , James W. Demmel , David H. Bailey , Greg Henry , Yozo Hida , Jimmy Iskandar , William Kahan , Suh Y. Kang , Anil Kapur , Michael C. Martin , Brandon J. Thompson , Teresa Tung , Daniel J. Yoo, Design, implementation and testing of extended and mixed precision BLAS, ACM Transactions on Mathematical Software (TOMS), v.28 n.2, p.152-205, June 2002
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jack Dongarra , Ian Foster , Geoffrey Fox , William Gropp , Ken Kennedy , Linda Torczon , Andy White, References, Sourcebook of parallel computing, Morgan Kaufmann Publishers Inc., San Francisco, CA, 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jeff Bilmes , Krste Asanovic , Chee-Whye Chin , Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Proceedings of the 11th international conference on Supercomputing, p.340-347, July 07-11, 1997, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lamia Youseff , Keith Seymour , Haihang You , Jack Dongarra , Rich Wolski, The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software, Proceedings of the 17th international symposium on High performance distributed computing, June 23-27, 2008, Boston, MA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Thierry Joffrain , Tze Meng Low , Enrique S. Quintana-Ortí , Robert van de Geijn , Field G. Van Zee, Accumulating Householder transformations, revisited, ACM Transactions on Mathematical Software (TOMS), v.32 n.2, p.169-179, June 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jaeyoung Choi , Jack J. Dongarra , L. Susan Ostrouchov , Antoine P. Petitet , David W. Walker , R. Clint Whaley, Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines, Scientific Programming, v.5 n.3, p.173-184, August 1996
|
|
|
|
|
|
|
|
|
Angelines Alberto , Jesús Benet , Enrique Arias , David Cebrian , Tomás Rojo , Fernando Cuartero, A high performance tool for the simulation of the dynamic pantograph-catenary interaction, Mathematics and Computers in Simulation, v.79 n.3, p.652-667, December, 2008
|
|
|
Gregorio Quintana-Ortí , Francisco D. Igual , Enrique S. Quintana-Ortí , Robert A. van de Geijn, Solving dense linear systems on platforms with multiple hardware accelerators, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|
|
|
|
|
|
|
|
Lamia Youseff , Keith Seymour , Haihang You , Dmitrii Zagorodnov , Jack Dongarra , Rich Wolski, Paravirtualization effect on single- and multi-threaded memory-intensive linear algebra software, Cluster Computing, v.12 n.2, p.101-122, June 2009
|
REVIEW
"Charles Raymond Crawford : Reviewer"
This paper describes additions to the set of FORTRAN Basic Linear
Algebra Subprograms, or BLAS. The original set of BLAS [1] has been
widely accepted and many published programs refer to it. Special machine-code
implementations of the BLAS that
more...
|