| Transforming FORTRAN DO loops to improve performance on vector architectures |
| Full text |
Pdf
(1.98 MB)
|
| Source
|
ACM Transactions on Mathematical Software (TOMS)
archive
Volume 12 , Issue 4 (December 1986)
table of contents
Pages: 324 - 353
Year of Publication: 1986
ISSN:0098-3500
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 36, Citation Count: 3
|
|
|
ABSTRACT
The performance of programs executing on vector computers is significantly improved when the number of accesses to memory can be reduced. Unrolling Fortran DO loops, followed by substitutions and eliminations in the unrolled code, can reduce the number of loads and stores. In this paper we characterize the unrolling transformation and associated transformations of Fortran DO loops and describe a set of software tools to carry out these transformations. The tools use the machinery available in Toolpack and have been integrated into that environment. We describe the results of applying these tools to a collection of linear algebra subroutines.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
ANSI X3.9-1978. Programming Language FORTRAN. American National Standards Institute, Inc., New York, 1978.
|
| |
3
|
COWELL, W. R., HAGUE, S. J., A~qD ILES, R. M.J. Toolpack/1 Introductory Guide. Publication NP1007, Numerical Algorithms Group, Downers Grove, Ill., and Oxford, U.K., 1985.
|
| |
4
|
COWELL, W. R. The Toolpack Tools ISTUD, ISTCD, and ISTSB: Guide for Users and installers. MCS-TM-74, Argonne National Lab., Argonne, Ill., 1986 (also included on the Toolpack distribution tape).
|
| |
5
|
DALY, C., AND DU CROZ, J. J. Performance of a subroutine library on vector processing machines. In Vector and Parallel Processors in Computational Science, I. S. Duff and J. K. Reid, Eds. North-Holland, Amsterdam, The Netherlands, 1985.
|
 |
6
|
|
| |
7
|
DONGARRA, J. J., AND HINDS, A. R. Unrolling loops in FORTRAN. Softw. Pract. Exper. 9 (1979), 219-226.
|
 |
8
|
|
REVIEW
"Michael Wolfe : Reviewer"
This is a paper from some people “in the trenches” who are trying to
improve performance in machines like the Cray supercomputers. As has been
noted in many places, the Cray compilers do not produce the best performance.
One approach
more...
|