ACM Home Page
Please provide us with feedback. Feedback
Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues
Full text PdfPdf (155 KB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 24 ,  Issue 3  (September 1998) table of contents
Pages: 303 - 316  
Year of Publication: 1998
ISSN:0098-3500
Authors
Bo Kågström  Umeå Univ., Umeå, Sweden
Charles van Loan  Cornell Univ., Ithaca, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 46,   Citation Count: 6
Additional Information:

appendices and supplements   abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/292395.292426
What is a DOI?

APPENDICES and SUPPLEMENTS
gZip784.gz (338 KB)
Software for "GEMM-Based Level 3 BLAS: Portability and Optimization Issues"


ABSTRACT

This companion article discusses portability and optimization issues of the GEMM-based level 3 BLAS model implementations and the performance evaluation benchmark. All software comes in all four data types (single- and double-precision, real and complex) and are designed to be easy to implement and use on different platforms. Each of the GEMM-based routines has a few machine-dependent parameters that specify internal block sizes, cache characteristics, and branch points for alternative code sections. These parameters provide means for adjustment to the characteristics of a memory hierarchy.





REVIEW

"Timothy R. Hopkins : Reviewer"

The basic linear algebra subroutines (BLAS) consist of three libraries (known as Levels 1, 2, and 3) and form an integral part of much of the important numerical software developed over the last two decades. Efficient implementatio  more...

Collaborative Colleagues:
Bo Kågström: colleagues
Charles van Loan: colleagues