|
ABSTRACT
The constraint tree provides a uniform framework for representing many loop transformations. It allows us to estimate the performance of several alternative execution methods before committing to any of the transformations.
We introduce the constraint tree, show how it is built, and demonstrate its use for vectorization and parallel decomposition. We show how unconstrained loops can be moved to reduce the costs of memory accesses.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Alle83
|
|
| |
ACK86
|
J.R. Allen, D. C~llahan, and K. Kennedy, "Automatic Decomposition of Scientific Programs for Parallel Execution", Report TR86-42, Department of Oomputer Science, Rice University, Houston, TX, November, 1986.
|
 |
AK87
|
|
| |
Bane76
|
U. Banerjee, "D~ta dependence in ordinary programs", Report 76-837, Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, II, 1979.
|
| |
Call86
|
D. Callahan,"Dependence testing in PFC: Weak Separability", Supercomputer Software Newsletter ~2, Rice University, August 1986.
|
| |
Coha73
|
W. L. Cohagen, "Vector Optimization for the ASC", Proc. Sever~th Annual Princeton Conferer~ce on Informatio~ Sciences and Systems, Department of Electrical Engineering, Princeton, N J, 1973.
|
 |
FOW87
|
|
| |
IT86
|
F. Irigoin, R. Triolet, "Computing Dependence Direction Vectors and Dependence Cones with Linear Systems," EBSNP-CAt-87-E94, Ecole des Mines de Paris, Fontainebleau, France. August 1986.
|
| |
Kuck78
|
|
| |
KKLW80a
|
D. J. Kuck, R. H. Kuhn, B. Leasure, M. Wolfe, "The structure of an advanced vectorizer for pipelined processors'', Proc. IEEE Computer Society Fourth In.t'l Computer Software and Applications Conference,, IEEE,, Chicago, II, October 1980.
|
| |
KKLW80b
|
D. J. Kuck, R. H. Kuhn, B. Leasure, M. Wolfe, "Analysis and transform;rtion of programs for parallel computation", Proc. IEEE Computer Society Fourth Int'l Computer Software and Applications Conference, , IEEE, Chicago, II, October 1980.
|
 |
Lamp74
|
|
 |
PW86
|
|
| |
SK86
|
|
| |
Tarj72
|
R. Tarjan "Depth first search and linear graph algorithms", SIAM Journal of Computinq, 1,2 1972. pp. 146-160.
|
| |
Wolf78
|
M.J. Wolfe, "Techniques for improving the inherent, parallelism in programs", Repor~ 78-929, Dep~rtlnent: of Computer Science, University of Illinois at Urbana- Champaign, Urbana, I1, July 1978.
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|