| Low power microarchitecture with instruction reuse |
| Full text |
Pdf
(876 KB)
|
Source
|
Conference On Computing Frontiers
archive
Proceedings of the 5th conference on Computing frontiers
table of contents
Ischia, Italy
SESSION: Innovative microarchitecture I
table of contents
Pages 149-158
Year of Publication: 2008
ISBN:978-1-60558-077-7
|
|
Authors
|
|
Frederico Pratas
|
INESC-ID/IST, Lisbon, Portugal
|
|
Georgi Gaydadjiev
|
TU Delft, Delft, Netherlands
|
|
Mladen Berekovic
|
TU Braunschweig, Braunschweig, Germany
|
|
Leonel Sousa
|
INES-ID/IST, Lisbon, Portugal
|
|
Stefanos Kaxiras
|
University of Patras, Patras, Greece
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 60, Citation Count: 0
|
|
|
ABSTRACT
Power consumption has become a very important metric and challenging research topic in the design of microprocessors in the recent years. The goal of this work is to improve power efficiency of superscalar processors through instruction reuse at the execution stage. This paper proposes a new method for reusing instructions when they compose small loops: the loop's instructions are first buffered in the Reorder Buffer and reused afterwards without the need for dynamically unrolling the loop, as commonly implemented by the traditional instruction reusing techniques. The proposed method is implemented with the introduction of two new auxiliary hardware structures in a typical superscalar microarchitecture: a Finite State Machine (FSM), used to detect the reusable loops; and a Log used to store the renaming data for each instruction when the loop is "unrolled". In order to evaluate the proposed method we modified the sim-outorder tool from Simplescalar v3.0 for the PISA, and Wattch v1.02 Power Performance simulators. Several different configurations and benchmarks have been used during the simulations. The obtained results show that by implementing this new method in a superscalar microarchitecture, the power efficiency can be improved without significantly affecting neither the performance nor the cost.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
G. Hamerly, E. Perelman, J. Lau, and B. Calder. Simpoint 3.0: Faster and more flexible program analysis. The Journal of Instruction-Level Parallelism, 7, Sep 2005.
|
| |
4
|
|
| |
5
|
|
| |
6
|
J. S. Hu , N. Vijaykrishnan , S. Kim , M. Kandemir , M. J. Irwin, Scheduling Reusable Instructions for Power Reduction, Proceedings of the conference on Design, automation and test in Europe, p.10148, February 16-20, 2004
|
 |
7
|
|
| |
8
|
G. E. Moore. Cramming more components onto integrated circuits. Proceedings of the IEEE, 86:82--85, Jan 1998.
|
 |
9
|
Subbarao Palacharla , Norman P. Jouppi , J. E. Smith, Complexity-effective superscalar processors, Proceedings of the 24th annual international symposium on Computer architecture, p.206-218, June 01-04, 1997, Denver, Colorado, United States
|
| |
10
|
S. J. Patel, D. H. Friendly, and Y. N. Patt. Critical issues regarding the trace cache fetch mechanism. Technical Report CSE-TR-335-97, University of Michigan, July 1997.
|
| |
11
|
F. Pratas. Low power microarchitecture with instruction reuse. Technical Report 21, INESC-ID, September 2007.
|
| |
12
|
C. Price. MIPS IV Instruction Set, revision 3.2. MIPS Technologies, Inc., Mountain View, CA, September 1995.
|
| |
13
|
J. P. Shen and M. H. Lipasti. Modern Processor Design: Fundamentals of Superscalar Processors. McGraw-Hill Companies, Inc., New York, NY, USA, 1st edition, 2005.
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
S. Wilton and N. Jouppi. Cacti: An enhanced cache access and cycle time model. IEEE Journal of Solid-State Circuits, 31:677--688, may 1996.
|
 |
18
|
|
|