ACM Home Page
Please provide us with feedback. Feedback
Macro-op Scheduling: Relaxing Scheduling Loop Constraints
Full text PdfPdf (357 KB)
Source International Symposium on Microarchitecture archive
Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture table of contents
Page: 277  
Year of Publication: 2003
ISBN:0-7695-2043-X
Authors
Ilhyun Kim  Department of Electrical and Computer Engineering, University of Wisconsin -Madison
Mikko H. Lipasti  Department of Electrical and Computer Engineering, University of Wisconsin -Madison
Sponsor
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 30,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Ensuring back-to-back execution of dependent instructionsin a conventional out-of-order processor requiresscheduling logic that wakes up and selects instructions atthe same rate as they are executed. To sustain high performance,integer ALU instructions typically have single-cyclelatency, consequently requiring scheduling logic withthe same single-cycle latency. Prior proposals have advocatedthe use of speculation in either the wakeup or selectphases to enable pipelining of scheduling logic to achievehigher clock frequency. In contrast, this paper proposesmacro-op scheduling, which systematically removesinstructions with single-cycle latency from the machine bycombining them into macro-ops, and performs nonspeculativepipelined scheduling of multi-cycle operations. Macro-opscheduling also increases the effective size of the schedulingwindow by enabling multiple instructions to occupy asingle issue queue entry. We demonstrate that pipelined 2-cyclemacro-op scheduling performs comparably or evenbetter than atomic scheduling or prior proposals for select-freescheduling.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
[2] G. Hinton et al., The microarchitecture of the Pentium 4 processor, Intel Technology Journal Q1, 2001.
3
4
5
 
6
[6] K. Diefendorff, K7 challenges Intel, Microprocessor Report, Vol. 12, No. 14, 1998.
7
 
8
9
10
 
11
 
12
 
13
[13] D. C. Burger and T. M. Austin, The Simplescalar tool set, version 2.0, Technical Report CS-TR-97-1342, University of Wisconsin, Madison, 1997.
 
14
[14] A. Kleinosowski, J. Flynn, N. Meares and D. J. Lilja, Adapting the SPEC2000 benchmarks suite for simulation-based computer architecture research, Workshop on Workload Characterization in International Conference on Computer Design, 2000.
 
15
[15] Compaq Computer Corporation, Alpha 21264 microprocessor hardware reference manual, 1999.
16
17
 
18
[18] S. Gochman et al., The Intel Pentium M processor: Microarchitecture and performance, Intel Technology Journal vol. 7, issue 2, 2003.
19
20
 
21
22
 
23
24
 
25

CITED BY  15

Collaborative Colleagues:
Ilhyun Kim: colleagues
Mikko H. Lipasti: colleagues