ACM Home Page
Please provide us with feedback. Feedback
Improving register allocation for subscripted variables
Full text PdfPdf (1.93 MB)
Source ACM SIGPLAN Notices archive
Volume 39 ,  Issue 4  (April 2004) table of contents
Best of PLDI 1979-1999
SPECIAL ISSUE: 1990 table of contents
Pages: 328 - 342  
Year of Publication: 2004
ISSN:0362-1340
Authors
David Callahan  Cray, Inc., Seattle, WA
Steve Carr  Michigan Technological University, Houghton, MI
Ken Kennedy  Rice University, Houston, TX
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 39,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/989393.989428
What is a DOI?

ABSTRACT

Most conventional compilers fail to allocate array elements to registers because standard data-flow analysis treats arrays like scalars, making it impossible to analyze the definitions and uses of individual array elements. This deficiency is particularly troublesome for floating-point registers, which are most often used as temporary repositories for subscripted variables.In this paper, we present a source-to-source transformation, called scalar replacement, that finds opportunities for reuse of subscripted variables and replaces the references involved by references to temporary scalar variables. The objective is to increase the likelihood that these elements will be assigned to registers by the coloring-based register allocators found in most compilers. In addition, we present transformations to improve the overall effectiveness of scalar replacement and show how these transformations can be applied in a variety of loop nest types. Finally, we present experimental results showing that these techniques are extremely effective---capable of achieving integer factor speedups over code generated by good optimizing compilers of conventional design.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
F. E. Allen and J. Cocke. A catalogue of optimizing transformations. In Design and Optimization of Compilers, pages 1--30. Prentice-Hall, 1972.
3
4
 
5
6
 
7
 
8
 
9
10
 
11
 
12
G. Chaitin, M. Auslander, A. Chandra, J. Cocke, M. Hopkins, and P. Markstein. Register allocation via coloring. Computer Languages, 6:45--57, Jan. 1981.
13
14
15
16
 
17
D. Kuck, R. Kuhn, B. Leasure, and M. Wolfe. The structure of an advanced retargetable vectorizer. In Supercomputers: Design and Applications, pages 163--178. IEEE Computer Society Press, Silver Spring, MD., 1984.
18
 
19
D. Kuck, Y. Muraoka, and S. Chen. On the number of operations simultaneously executable in fortran-like programs and their resulting speedup. IEEE Transactions on Computers, C-21(12):1293--1310, Dec. 1972.
20
21
22
 
23
24
25
 
26
 
27
M. Wolfe. Advanced loop interchange. In Proceedings of the 1986 International Conference on Parallel Processing, Aug. 1986.
 
28
 
29
{AC72} F. E. Allen and J. Cocke. A catalogue of optimizing transformations. In Design and Optimization of Compilers, pages 1--30. Prentice-Hall, 1972.
30
 
31
{AK84b} J. R. Allen and K. Kennedy. PFC: A program to convert fortran to parallel form. In Supercomputers: Design and Applications, pages 186--205. IEEE Computer Society Press, Silver Spring, MD., 1984.
 
32
{AK88} J. R. Allen and K. Kennedy. Vector register allocation. Technical report, Department of Computer Science, Rice University, 1988.
 
33
 
34
{CAC+81} G. J. Chaitin, M. A. Auslander, A. K. Chandra, J. Cocke, M. E. Hopkins, and P. W. Markstein. Register allocation via coloring. Computer Languages, 6:45--57, January 1981.
 
35
{CCK87} D. Callahan, J. Cocke, and K. Kennedy. Estimating interlock and improving balance for pipelined machines. In Proceedings of the 1987 International Conference on Parallel Processing, August 1987.
 
36
 
37
{DBMS79} J. J. Dongarra, J. R. Bunch, C. B. Moler, and G. W. Stewart. LINPACK User's Guide. SIAM Publications, Philadelphia, 1979.
 
38
39
 
40
 
41
 
42
 
43
{Wol86} M. Wolfe. Advanced loop interchange. In Proceedings of the 1986 International Conference on Parallel Processing, August 1986.
 
44
45

Collaborative Colleagues:
David Callahan: colleagues
Steve Carr: colleagues
Ken Kennedy: colleagues