ACM Home Page
Please provide us with feedback. Feedback
Height reduction of control recurrences for ILP processors
Full text PdfPdf (1.36 MB)
Source International Symposium on Microarchitecture archive
Proceedings of the 27th annual international symposium on Microarchitecture table of contents
San Jose, California, United States
Pages: 40 - 51  
Year of Publication: 1994
ISBN:0-89791-707-3
Authors
Michael Schlansker  Hewlett-Packard Laboratories, 1501 Page Mill Road, Palo Alto, CA
Vinod Kathail  Hewlett-Packard Laboratories, 1501 Page Mill Road, Palo Alto, CA
Sadun Anik  Hewlett-Packard Laboratories, 1501 Page Mill Road, Palo Alto, CA
Sponsors
IEEE-CS\TCMM : TC on Microprocessors & Microcomputers
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 11,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/192724.192729
What is a DOI?

ABSTRACT

The performance of applications executing on processors with instruction level parallelism is often limited by control and data dependences. Performance bottlenecks caused by dependences can frequently be eliminated through transformations which reduce the height of critical paths through the program. While height reduction techniques are not always helpful, their utility can be demonstrated in a broad range of important situations.This paper focuses on the height reduction of control recurrences within loops with data dependent exits. Loops with exits are transformed so as to alleviate performance bottlenecks resulting from control dependences. A compilation approach to effect these transformations is described. The techniques presented in this paper used in combination with prior work on reducing the height of data dependences provide a comprehensive approach to accelerating loops with conditional exits.In many cases, loops with conditional exits provide a degree of parallelism traditionally associated with vectorization. Multiple iterations of a loop can be retired in a single cycle on a processor with adequate instruction level parallelism with no cost in code redundancy. In more difficult cases, height reduction requires redundant computation or may not be feasible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
A. Nicolau and J. A. Fisher. Measuring the parallelism available for very long instruction word architectures. IEEE Transactions on Computers C-33, 11 (1984), 968-976.
3
4
5
 
6
 
7
 
8
J.A. Fisher. Trace scheduling: A technique for global microcode compaction. IEEE Transactions on Computers C- 30, 7 (1981), 478-490.
 
9
 
10
11
 
12
J.A. Fisher. Global code generation for instruction-level parallelism: trace scheduling-2. Technical Report HPL-93- 43, Hewlett-Packard Laboratories. Palo Alto CA, 1993.
 
13
 
14
 
15
16
17
 
18
B. R. Rau. Cydra 5 directed dataflow architecture. Proceedings of the COMPCON '88 (San Francisco, 1988). 106-113.
 
19
 
20
J.C. H, Park and M. S. Schlansker. On predicated execution. Technical Report HPL-91-58, Hewlett-Packard Laboratories, Palo Alto CA, 1991.
 
21
V. Kathail, M. S. Schlansker, and B. R. Rau. HPL PlayDoh architecture specification: Version 1.0. Technical Report HPL-93-80, Hewlett-Packard Laboratories, Palo Alto CA, 1993.
 
22
 
23
24
25
 
26
27
28
 
29
M.S. Schlansker, V. Kathail, and S. Anik. Parallelization of control recurrences for ILP processors. Technical Report HPL-94-75, Hewlett-Packard Laboratories, Palo Alto Ca., 1994.

CITED BY  12

Collaborative Colleagues:
Michael Schlansker: colleagues
Vinod Kathail: colleagues
Sadun Anik: colleagues