ACM Home Page
Please provide us with feedback. Feedback
Transforming FORTRAN DO loops to improve performance on vector architectures
Full text PdfPdf (1.98 MB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 12 ,  Issue 4  (December 1986) table of contents
Pages: 324 - 353  
Year of Publication: 1986
ISSN:0098-3500
Authors
Wayne R. Cowell  Argonne National Laboratory, Argonne, IL
Christopher P. Thompson  Atomic Energy Research Establishment, Harwell, UK
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 36,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/22721.24035
What is a DOI?

ABSTRACT

The performance of programs executing on vector computers is significantly improved when the number of accesses to memory can be reduced. Unrolling Fortran DO loops, followed by substitutions and eliminations in the unrolled code, can reduce the number of loads and stores. In this paper we characterize the unrolling transformation and associated transformations of Fortran DO loops and describe a set of software tools to carry out these transformations. The tools use the machinery available in Toolpack and have been integrated into that environment. We describe the results of applying these tools to a collection of linear algebra subroutines.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
ANSI X3.9-1978. Programming Language FORTRAN. American National Standards Institute, Inc., New York, 1978.
 
3
COWELL, W. R., HAGUE, S. J., A~qD ILES, R. M.J. Toolpack/1 Introductory Guide. Publication NP1007, Numerical Algorithms Group, Downers Grove, Ill., and Oxford, U.K., 1985.
 
4
COWELL, W. R. The Toolpack Tools ISTUD, ISTCD, and ISTSB: Guide for Users and installers. MCS-TM-74, Argonne National Lab., Argonne, Ill., 1986 (also included on the Toolpack distribution tape).
 
5
DALY, C., AND DU CROZ, J. J. Performance of a subroutine library on vector processing machines. In Vector and Parallel Processors in Computational Science, I. S. Duff and J. K. Reid, Eds. North-Holland, Amsterdam, The Netherlands, 1985.
6
 
7
DONGARRA, J. J., AND HINDS, A. R. Unrolling loops in FORTRAN. Softw. Pract. Exper. 9 (1979), 219-226.
8



REVIEW

"Michael Wolfe : Reviewer"

This is a paper from some people “in the trenches” who are trying to improve performance in machines like the Cray supercomputers. As has been noted in many places, the Cray compilers do not produce the best performance. One approach  more...

Collaborative Colleagues:
Wayne R. Cowell: colleagues
Christopher P. Thompson: colleagues