ACM Home Page
Please provide us with feedback. Feedback
An efficient wakeup design for energy reduction in high-performance superscalar processors
Full text PdfPdf (365 KB)
Source Conference On Computing Frontiers archive
Proceedings of the 2nd conference on Computing frontiers table of contents
Ischia, Italy
SESSION: Track 12: temperature, energy, and complexity-aware designs table of contents
Pages: 353 - 360  
Year of Publication: 2005
ISBN:1-59593-019-1
Authors
Kuo-Su Hsiao  National Cheng Kung University, Taiwan
Chung-Ho Chen  National Cheng Kung University, Taiwan
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 31,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1062261.1062319
What is a DOI?

ABSTRACT

In modern superscalar processors, the complex instruction scheduler could form the critical path of the pipeline stages and limit the clock cycle time. In addition, complex scheduling logic results in the formation of a hot spot on the processor chip. Consequently, the latency and power consumption of the dynamic scheduler are two of the most crucial design issues when developing a high-performance microprocessor. We propose an instruction wakeup scheme that remedies the speed and power issues faced with conventional designs. This is achieved by a new design that separates RAM cells from the match circuits. This separated design is such that the advantages of the CAM and bit-map RAM schemes are retained, while their respective disadvantages are eliminated. Specifically, the proposed design retains the moderate area advantage of the CAM scheme and the low power and low latency advantages of the bit-map RAM schemeThe experimental results show that the proposed design saves power consumption by 80% compared to the traditional CAM-based design and 18% to the bit-map RAM design, respectively. In speed, the proposed design reduces an average of 77% in the wakeup latency compared to the conventional CAM-based design and an average of 33% reduction of the latency of the bit-map RAM design. For an 8-issue superscalar processor, the proposed design reduces the power consumption of the conventional wakeup logic by 80%, while simultaneously increasing the Instruction Count per nano-second (IPns) by a factor of approximately 2.5 times with a moderate area cost


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Palacharla, N. P. Jouppi, and J. E. Smith, "Quantifying the Complexity of Superscalar Processors," Tech.Rep. CS-1328, University of Wisconsin-Madison, May 1997.
2
3
 
4
S.H. Gunther, "personal communication," Intel Corporation, May 2003.
5
6
 
7
D. Burger and T. M. Austin, "The SimpleScalar tool set, version 2.0," Tech. Rep. CS-1342, University of Wisconsin-Madison, June 1997.
 
8
SPEC System Performance Evaluation Committee, www.spec.org.
 
9
G. Reinman and N. P. Jouppi, "CACTI 2.0: An Integrated Cache Timing and Power Model," Tech. Rep. Compaq Western Research Lab, February 2000.
 
10
 
11
12
13
14
15
 
16
R. Ho, K. W. Mai, and M. A. Horowitz, "The Future of Wires," Proceedings of the IEEE, 89(4):490--504, April 2001.
17
18
 
19
20


Collaborative Colleagues:
Kuo-Su Hsiao: colleagues
Chung-Ho Chen: colleagues