| Constructing and exploiting linear schedules with prescribed parallelism |
| Full text |
Pdf
(159 KB)
|
| Source
|
ACM Transactions on Design Automation of Electronic Systems (TODAES)
archive
Volume 7 , Issue 1 (January 2002)
table of contents
Pages: 159 - 172
Year of Publication: 2002
ISSN:1084-4309
|
|
Authors
|
|
Alain Darte
|
CNRS, LIP, École Normale Supérieure de Lyon, France
|
|
Robert Schreiber
|
Hewlett-Packard Company, Palo Alto, CA
|
|
B. Ramakrishna Rau
|
Hewlett-Packard Company, Palo Alto, CA
|
|
Frédéric Vivien
|
ICPS/LSIIT, Université Louis Pasteur, Strasbourg, France
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 22, Citation Count: 7
|
|
|
ABSTRACT
We present two new results of importance in code generation for and synthesis of synchronously scheduled parallel processor arrays and multicluster VLIWs. The first is a new practical method for constructing a linear schedule for the iterations of a loop nest that schedules precisely one iteration per cycle on each of a prescribed set of processors. While this problem goes back to the era in which systolic computation was in vogue, it has defied practical solution until now. We provide a closed form solution that enables the enumeration of all such schedules. The second result is a new technique that reduces the cost of code or hardware whose function is to control the flow of data and predicate operations, and to generate memory addresses. The key idea is that by using the mathematical structure of any of the conflict-free schedules we construct, a very shallow recurrence can be developed to inexpensively update these quantities.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
DARTE, A. AND DELOSME, J.-M. 1990. Partitioning for array processors. Tech. Rep. 90-23, LIP, ENS-Lyon.
|
| |
4
|
DARTE, A., SCHREIBER, R., RAU, B. R., AND VIVIEN, F. 1999. A constructive solution to the juggling problem in systolic array synthesis. Tech. Rep. RR1999-15, LIP, ENS-Lyon.
|
| |
5
|
HAJOS, G. 1942. Uber einfache und mehrfache Bedeckung des n-dimensionalen Raumes mit einem Wurfelgitter. Mathematische Zeitschrifte 47, 427-467.
|
| |
6
|
|
| |
7
|
NEWMAN, M. 1972. Integral Matrices. Academic, New York.
|
| |
8
|
RAU, B. R. 1996. Iterative modulo scheduling. Int. J. Parallel Process. 24, 3-64. Also available as HP Labs Tech. Rep. HPL-94-115.
|
| |
9
|
Robert Schreiber , Shail Aditya , B. Ramakrishna Rau , Vinod Kathail , Scott Mahlke , Santosh Abraham , Greg Snider, High-Level Synthesis of Nonprogrammable Hardware Accelerators, Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors, p.113, July 10-12, 2000
|
| |
10
|
|
CITED BY 7
|
|
Alain Darte , Rob Schreiber , Gilles Villard, Lattice-based memory allocation, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
|
|
|
|
|
|
|
|
|
Hongbo Rong , Zhizhong Tang , R. Govindarajan , Alban Douillet , Guang R. Gao, Single-Dimension Software Pipelining for Multi-Dimensional Loops, Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization, p.163, March 20-24, 2004, Palo Alto, California
|
|
|
|
|
|
|
|
|
Robert Schreiber , Shail Aditya , Scott Mahlke , Vinod Kathail , B. Ramakrishna Rau , Darren Cronquist , Mukund Sivaraman, PICO-NPA: High-Level Synthesis of Nonprogrammable Hardware Accelerators, Journal of VLSI Signal Processing Systems, v.31 n.2, p.127-142, June 2002
|
|