|
ABSTRACT
In this paper we generalize the framework of linear loop transformations: we consider loop alignment as a new component in the transformation process. The aim is to exploit the additional inherent statement-level parallelism and reduce the amount of interprocessor synchronization and communication when a coarse-grain MIMD execution model is considered. The transformation process is modelled with non-singular matrices and we use the ideas recently proposed in this field to generate an efficient transformed code. However, additional aspects have to be studied when statements are considered in the process. We try to reduce the overhead due to conditionals that appear in the loop body of the transformed loops.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Banerjee U., Unimodular Transforlnations of Double Loops, chapter 10 of Advances in Languages and Compilers for Parallel Processing, The MIT Press, 1991.
|
| |
4
|
|
| |
5
|
|
| |
6
|
Fernandez A., Systematic Transformation of Systolic Algorithms for Programming Distribated Memory Multiprocessors, Ph.D. Thesis, Department of Computer Architecture, Polytechnic University of Catalunya (Spain), November 1992.
|
| |
7
|
|
| |
8
|
|
| |
9
|
Lu L. and Chen M., New Loop Transformation Techniques for Massive Parallelism, Research Report TR-833, Department of Computer Science, Yale University, October 1990.
|
| |
10
|
|
| |
11
|
|
| |
12
|
Liu L.S., Ho C.W. and Sheu J.P., On the Parallelism of Nested For-loops Using Index Shift Method, in Proceedings the 1090 Int. Conference on Parallel Processing, vo. II, St. Charles (IL), August 1990.
|
| |
13
|
Darte A. and Robert Y., Scheduling Uniform Loop Nests, Research Report no. 92-10, Laboratoire de l'Informatique du Parallelisme, Ecole Normale Superieure de Lyon (France), February 1992.
|
| |
14
|
|
| |
15
|
Ayguade E., Labarta J., Torres J., Llaberia J.M. and Valero M., Parallelism Evaluation and Partitioning of Nested Loops for Shared Memory Multiprocessors, chapter 11 of Advances in Langmtges cmd Compilers for Parallel Processing, The MIT Press, 1991.
|
 |
16
|
|
CITED BY 7
|
|
|
|
|
Nawaaz Ahmed , Nikolay Mateev , Keshav Pingali, Synthesizing transformations for locality enhancement of imperfectly-nested loop nests, Proceedings of the 14th international conference on Supercomputing, p.141-152, May 08-11, 2000, Santa Fe, New Mexico, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|