|
ABSTRACT
Today's microprocessor cores reach high performance levels not only by their high clock rate but also by the concurrent execution of a large number of instructions. Because of the relationship between power and frequency, it becomes attractive to run an OoO (Out-of-Order) core at a frequency lower than its nominal frequency in the context of embedded or real-time systems. Unfortunately, whereas OoO pipelines have high average throughput, their highly variable and hard-to-predict execution rate makes them unsuitable for real-time systems with hard or even soft deadlines. In this paper, we demonstrate that the execution time of an OoO processor can be stable and predictable by controlling its MIPS (Mega Instructions Per Second) rate via a PID (Proportional, Integral, and Differential gain) feedback controller and DVFS (Dynamic Voltage and Frequency Scaling). The stabilized processor uses much less power per committed instruction, because of the reduced average frequency. The EPI (Energy Per Instruction) is also cut by an average of 28% across our benchmark programs. Since a stable MIPS rate is maintained consistently with lower power/energy per instruction, OoO processors stabilized by a feedback controller can realistically be deployed in real-time systems. To demonstrate this capability we select a subset of the MiBench benchmarks that displays the widest execution rate variations and stabilize their MIPS rate in the context of a 1GHz Pentium III-like microarchitecture.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
Doug Burger and Todd M. Austin. The SimpleScalar Tool Set Version 2.0. Technical Report 1342, Computer Sciences Department, University of Wisconsin--Madison, May 1997.
|
| |
5
|
Buce R. Childers, H. Tang and Rami Melhem, Adapting Processor Supply Voltage to Instruction-Level Parallelism, Koolchips 2000, during the 33rd Int'l. Symp. on Microarchitecture (MICRO--33), Monterey, CA, December 10, 2000.
|
 |
6
|
Francisco J. Cazorla , Peter M. W. Knijnenburg , Rizos Sakellariou , Enrique Fernández , Alex Ramirez , Mateo Valero, Architectural support for real-time task scheduling in SMT processors, Proceedings of the 2005 international conference on Compilers, architectures and synthesis for embedded systems, September 24-27, 2005, San Francisco, California, USA
[doi> 10.1145/1086297.1086320]
|
 |
7
|
Francisco J. Cazorla , Peter M.W. Knijnenburg , Rizos Sakellariou , Enrique Fernández , Alex Ramirez , Mateo Valero, Predictable performance in SMT processors, Proceedings of the 1st conference on Computing frontiers, April 14-16, 2004, Ischia, Italy
[doi> 10.1145/977091.977152]
|
| |
8
|
L. T. Clark, Circuit Design of XScale Microprocessors, In Proceedings of the 2001 Symposium on VLSI Circuits, June, 2001.
|
 |
9
|
|
| |
10
|
Steven Dropsho , Greg Semeraro , David H. Albonesi , Grigorios Magklis , Michael L. Scott, Dynamically Trading Frequency for Complexity in a GALS Microprocessor, Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture, p.157-168, December 04-08, 2004, Portland, Oregon
[doi> 10.1109/MICRO.2004.18]
|
 |
11
|
|
| |
12
|
|
 |
13
|
Stefan Valentin Gheorghita , Sander Stuijk , Twan Basten , Henk Corporaal, Automatic scenario detection for improved WCET estimation, Proceedings of the 42nd annual Design Automation Conference, June 13-17, 2005, Anaheim, California, USA
[doi> 10.1145/1065579.1065610]
|
| |
14
|
M. R. Guthaus , J. S. Ringenberg , D. Ernst , T. M. Austin , T. Mudge , R. B. Brown, MiBench: A free, commercially representative embedded benchmark suite, Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop, p.3-14, December 02-02, 2001
[doi> 10.1109/WWC.2001.15]
|
| |
15
|
Greg Hamerly, Erez Perelman, Jeremy Lau, and Brad Calder, SimPoint 3.0: Faster and More Flexible Program Analysis, Journal of Instruction Level Parallelism, September 2005.
|
| |
16
|
|
 |
17
|
|
 |
18
|
Christopher J. Hughes , Praful Kaul , Sarita V. Adve , Rohit Jain , Chanik Park , Jayanth Srinivasan, Variability in the execution of multimedia applications and implications for architecture, Proceedings of the 28th annual international symposium on Computer architecture, p.254-265, June 30-July 04, 2001, Göteborg, Sweden
|
| |
19
|
|
| |
20
|
Yoshifumi Ikenaga, Employ Supply Voltage Control to Save Energy, Energy-saving Designs, NEC corporation.
|
 |
21
|
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
| |
25
|
|
 |
26
|
Zhijian Lu , Jason Hein , Marty Humphrey , Mircea Stan , John Lach , Kevin Skadron, Control-theoretic dynamic frequency and voltage scaling for multimedia workloads, Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems, October 08-11, 2002, Grenoble, France
[doi> 10.1145/581630.581654]
|
| |
27
|
Mistry, K. Armstrong, M. Auth, C. Cea, S. Coan, T. Ghani, T. Hoffmann, T. Murthy, A. Sandford, J. Shaheed, R. Zawadzki, K. Zhang, K. Thompson, S. Bohr, M. Delaying forever: Uniaxial strained silicon transistors in a 90nm CMOS technology, Symposium on VLSI Technology, p. 50, (2004).
|
| |
28
|
Christian Poellabauer, Tao Zhang, Santosh Pande, and Karsten Schwan, An Efficient Frequency Scaling Approach for Energy-Aware Embedded Real-Time Systems, Proceedings of the International Conference on Architecture of Computing Systems (ARCS'05), Innsbruck, Austria, March 2005.
|
| |
29
|
|
| |
30
|
Greg Semeraro , David H. Albonesi , Steven G. Dropsho , Grigorios Magklis , Sandhya Dwarkadas , Michael L. Scott, Dynamic frequency and voltage control for a multiple clock domain microarchitecture, Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, November 18-22, 2002, Istanbul, Turkey
|
| |
31
|
John Paul Shen, Lost in the Bermuda Triangle: Complexity vs Energy vs Performance, WCED, June 18, 2006. http://www.csl.cornell.edu/~albonesi/wced06/shen.pdf
|
| |
32
|
|
| |
33
|
David C. Snowdon, Sergio Ruocco and Gernot Heiser, Power Management and Dynamic Voltage Scaling: Myths and Facts, Proceedings of the 2005 Workshop on Power Aware Real-time Computing, New Jersey, USA, September, 2005
|
| |
34
|
P. Stanley-Marbell and M. Hsiao and U. Kremer, A Hardware Architecture for Dynamic Performance and Energy Adaptation, In PACS-02, in conjunction with 8th IEEE International Symposium on High-Performance Computer Architecture.
|
| |
35
|
Jinho Suh and Michel Dubois, Dynamic MIPS Stabilization in Out-of-Order Processors, USC Computer Engineering Technical Report CENG-2009-1, 2009.
|
| |
36
|
|
 |
37
|
Ankush Varma , Brinda Ganesh , Mainak Sen , Suchismita Roy Choudhury , Lakshmi Srinivasan , Jacob Bruce, A control-theoretic approach to dynamic voltage scheduling, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
[doi> 10.1145/951710.951744]
|
| |
38
|
C Xu, TM Le, TT Lay, H.264/AVC CODEC: Instruction Level Complexity Analysis. Ninth IASTED International Conference on Internet and Multimedia Systems and Applications; Honolulu, HI; USA; 15--17 Aug. 2005.
|
| |
39
|
|
 |
40
|
|
|