ACM Home Page
Please provide us with feedback. Feedback
Low power microarchitecture with instruction reuse
Full text PdfPdf (876 KB)
Source
Conference On Computing Frontiers archive
Proceedings of the 5th conference on Computing frontiers table of contents
Ischia, Italy
SESSION: Innovative microarchitecture I table of contents
Pages 149-158  
Year of Publication: 2008
ISBN:978-1-60558-077-7
Authors
Frederico Pratas  INESC-ID/IST, Lisbon, Portugal
Georgi Gaydadjiev  TU Delft, Delft, Netherlands
Mladen Berekovic  TU Braunschweig, Braunschweig, Germany
Leonel Sousa  INES-ID/IST, Lisbon, Portugal
Stefanos Kaxiras  University of Patras, Patras, Greece
Sponsors
ACM: Association for Computing Machinery
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 60,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1366230.1366259
What is a DOI?

ABSTRACT

Power consumption has become a very important metric and challenging research topic in the design of microprocessors in the recent years. The goal of this work is to improve power efficiency of superscalar processors through instruction reuse at the execution stage. This paper proposes a new method for reusing instructions when they compose small loops: the loop's instructions are first buffered in the Reorder Buffer and reused afterwards without the need for dynamically unrolling the loop, as commonly implemented by the traditional instruction reusing techniques. The proposed method is implemented with the introduction of two new auxiliary hardware structures in a typical superscalar microarchitecture: a Finite State Machine (FSM), used to detect the reusable loops; and a Log used to store the renaming data for each instruction when the loop is "unrolled". In order to evaluate the proposed method we modified the sim-outorder tool from Simplescalar v3.0 for the PISA, and Wattch v1.02 Power Performance simulators. Several different configurations and benchmarks have been used during the simulations. The obtained results show that by implementing this new method in a superscalar microarchitecture, the power efficiency can be improved without significantly affecting neither the performance nor the cost.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
G. Hamerly, E. Perelman, J. Lau, and B. Calder. Simpoint 3.0: Faster and more flexible program analysis. The Journal of Instruction-Level Parallelism, 7, Sep 2005.
 
4
 
5
 
6
7
 
8
G. E. Moore. Cramming more components onto integrated circuits. Proceedings of the IEEE, 86:82--85, Jan 1998.
9
 
10
S. J. Patel, D. H. Friendly, and Y. N. Patt. Critical issues regarding the trace cache fetch mechanism. Technical Report CSE-TR-335-97, University of Michigan, July 1997.
 
11
F. Pratas. Low power microarchitecture with instruction reuse. Technical Report 21, INESC-ID, September 2007.
 
12
C. Price. MIPS IV Instruction Set, revision 3.2. MIPS Technologies, Inc., Mountain View, CA, September 1995.
 
13
J. P. Shen and M. H. Lipasti. Modern Processor Design: Fundamentals of Superscalar Processors. McGraw-Hill Companies, Inc., New York, NY, USA, 1st edition, 2005.
14
15
 
16
 
17
S. Wilton and N. Jouppi. Cacti: An enhanced cache access and cycle time model. IEEE Journal of Solid-State Circuits, 31:677--688, may 1996.
18

Collaborative Colleagues:
Frederico Pratas: colleagues
Georgi Gaydadjiev: colleagues
Mladen Berekovic: colleagues
Leonel Sousa: colleagues
Stefanos Kaxiras: colleagues