|
ABSTRACT
The constraint tree provides a uniform framework for representing many loop transformations. It allows us to estimate the performance of several alternative execution methods before committing to any of the transformations.
We introduce the constraint tree, show how it is built, and demonstrate its use for vectorization and parallel decomposition. We show how unconstrained loops can be moved to reduce the costs of memory accesses.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Alle83
|
|
| |
ACK86
|
J.R. Allen, D. C~llahan, and K. Kennedy, "Automatic Decomposition of Scientific Programs for Parallel Execution", Report TR86-42, Department of Oomputer Science, Rice University, Houston, TX, November, 1986.
|
 |
AK87
|
|
| |
Bane76
|
U. Banerjee, "D~ta dependence in ordinary programs", Report 76-837, Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, II, 1979.
|
| |
Call86
|
D. Callahan,"Dependence testing in PFC: Weak Separability", Supercomputer Software Newsletter ~2, Rice University, August 1986.
|
| |
Coha73
|
W. L. Cohagen, "Vector Optimization for the ASC", Proc. Sever~th Annual Princeton Conferer~ce on Informatio~ Sciences and Systems, Department of Electrical Engineering, Princeton, N J, 1973.
|
 |
FOW87
|
|
| |
IT86
|
F. Irigoin, R. Triolet, "Computing Dependence Direction Vectors and Dependence Cones with Linear Systems," EBSNP-CAt-87-E94, Ecole des Mines de Paris, Fontainebleau, France. August 1986.
|
| |
Kuck78
|
|
| |
KKLW80a
|
D. J. Kuck, R. H. Kuhn, B. Leasure, M. Wolfe, "The structure of an advanced vectorizer for pipelined processors'', Proc. IEEE Computer Society Fourth In.t'l Computer Software and Applications Conference,, IEEE,, Chicago, II, October 1980.
|
| |
KKLW80b
|
D. J. Kuck, R. H. Kuhn, B. Leasure, M. Wolfe, "Analysis and transform;rtion of programs for parallel computation", Proc. IEEE Computer Society Fourth Int'l Computer Software and Applications Conference, , IEEE, Chicago, II, October 1980.
|
 |
Lamp74
|
|
 |
PW86
|
|
| |
SK86
|
|
| |
Tarj72
|
R. Tarjan "Depth first search and linear graph algorithms", SIAM Journal of Computinq, 1,2 1972. pp. 146-160.
|
| |
Wolf78
|
M.J. Wolfe, "Techniques for improving the inherent, parallelism in programs", Repor~ 78-929, Dep~rtlnent: of Computer Science, University of Illinois at Urbana- Champaign, Urbana, I1, July 1978.
|
|