ACM Home Page
Please provide us with feedback. Feedback
Stall cycle redistribution in a transparent fetch pipeline
Full text PdfPdf (416 KB)
Source International Symposium on Low Power Electronics and Design archive
Proceedings of the 2006 international symposium on Low power electronics and design table of contents
Tegernsee, Bavaria, Germany
SESSION: Microarchitectural techniques for low power table of contents
Pages: 31 - 36  
Year of Publication: 2006
ISBN:1-59593-462-6
Authors
Eric L. Hill  University of Wisconsin - Madison
Mikko H. Lipasti  University of Wisconsin - Madison
Sponsors
ACM: Association for Computing Machinery
SIGDA: ACM Special Interest Group on Design Automation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 53,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1165573.1165583
What is a DOI?

ABSTRACT

Power and power density are now primary design constraints for modern high performance microprocessors. Up to 70% of the dynamic power consumed can be attributed to the clocking system. A consequence of this trend is that clock gating has emerged as both a necessary and efficient method to significantly reduce dynamic power.Transparent pipelining, a recently proposed fine-grain clock gating technique, has the potential to significantly reduce clock power above and beyond conventional pipestage-level clock gating. Previous studies of transparent pipelining have focused on the circuit and implementation-related issues of this approach, while neglecting the broader microarchitectural implications. This paper aims to quantify the microarchitectural opportunities that are afforded by the use of transparent pipelining in a processor's fetch pipeline. We develop a technique, based on stall cycle redistribution, designed to improve the performance of transparent pipelining on fetch and other high utilization pipelines. We show that stall cycle redistribution can dramatically reduce the clocking overhead of an aggressively pipelined Cell-like microprocessor.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
C. J. Anderson, J. Petrovick, J. M. Keaty, J. Warnock, G. Nussbaum, J. M. Tendler, C. Carter, S. Chu, J. Clabes, J. DiLullo, P. Dudley, P. Harvey, B. Krauter, J. LeBlanc, Lu Pong-Fei, B. McCredie, G. Plum, P. J. Restle, S. Runyon, M. Scheuermann, S. Schmidt, J. Wagoner, R. Weiss, S. Weitzel, B. Zoric. Physical Design of a Fourth-generation POWER GHz Microprocessor. In Proc. of the 2001 International Solid-State Circuits Conference, Februrary 2001.
2
 
3
4
 
5
H. Cain, K. Lepak, B. Schwarz, and M. H. Lipasti. Precise and Accurate Processor Simulation. In Workshop on Computer Architecture Evaluation using Commercial Workloads.
 
6
7
8
 
9
10
 
11
 
12
13
 
14
L. Kleinrock. Queueing Systems, Volume I: Theory. Wiley Interscience, New York, 1972.
15
 
16
K. Krewell. Cell Moves into the Limelight. In Microprocessor Report, February 2005.
17
 
18
19
 
20
J. E. Smith. An analysis of pipeline clocking. Technical report, University of Wisconsin, March 1990.
 
21
W. Ye, N. Vijaykrishnan, M. Kandemir, M. J. Irwin. The Design and Use of SimplePower: A Cycle-Accurate Energy Estimation Tool. In Proc. of the 36th Annual Design Automation Conference, June 1999.


Collaborative Colleagues:
Eric L. Hill: colleagues
Mikko H. Lipasti: colleagues