|
ABSTRACT
Due to increasing clock speeds, increasing design sizes and shrinking technologies, it is becoming more and more challenging to distribute a single global clock throughout a chip. In this paper we study the effect of using a Globally Asynchronous Locally Synchronous (GALS) organization for a superscalar, out-of-order processor, both in terms of power and performance. To this end, we propose a novel modeling and simulation environment for multiple clock cores with static or dynamically variable voltages for each synchronous block. Using this design exploration environment we were able to assess the power/performance tradeoffs available for Multiple Clock, Single Voltage (MCSV), as well as Multiple Clock, Dynamic Voltage (MCDV) cores. Our results show that MCSV processors are 10% more power efficient when compared to single-clock single voltage designs with a performance penalty of about 10%. By exploiting the flexibility of independent dynamic voltage scaling the various clock domains, the power efficiency of GALS designs can be improved by 12% on average, and up to 20% more in select cases. The power efficiency of MCDV cores becomes comparable with the one of Single Clock, Dynamic Voltage (SCDV) cores, while being up to 8% better in some cases. Our results show that MCDV cores consume 22% less power at an average 12% performance loss.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
R. Ronen, A. Mendelson, K. Lai, L. Shih-Lien, F. Pollack, and J. P. Shen, "Coming Challenges in Architecture and Microarchitecture," Proceedings of the IEEE, March 2001.
|
| |
3
|
|
| |
4
|
|
 |
5
|
A. Hemani , T. Meincke , S. Kumar , A. Postula , T. Olsson , P. Nilsson , J. Oberg , P. Ellervee , D. Lundqvist, Lowering power consumption in clock by using globally asynchronous locally synchronous design style, Proceedings of the 36th ACM/IEEE conference on Design automation, p.873-878, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.310091]
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
G. Qu et al., "Energy Minimization of System Pipelines Using Multiple Voltages," in IEEE Intl. Symp. on Circuits and Systems, 1999.
|
| |
12
|
|
| |
13
|
Greg Semeraro , Grigorios Magklis , Rajeev Balasubramonian , David H. Albonesi , Sandhya Dwarkadas , Michael L. Scott, Energy-Efficient Processor Design Using Multiple Clock Domains with Dynamic Voltage and Frequency Scaling, Proceedings of the 8th International Symposium on High-Performance Computer Architecture, p.29, February 02-06, 2002
|
 |
14
|
|
| |
15
|
|
| |
16
|
P. E. Gronowski, W. J. Bowhill, and R. P. Preston, "High-Performance Microprocessor Design," IEEE Journal of Solid State Circuits, May 1998.
|
| |
17
|
P. J. Restle et al., "A Clock Distribution Network for Microprocessors," IEEE Journal of Solid State Circuits (JSSC), May 2001.
|
| |
18
|
S. Tam, S. Rusu, U. N. Desai, R. Kim, J. Zhang, and I. Young, "Clock Generation and Distribution for the First IA-64 Microprocessor," IEEE Journal of Solid State Circuits (JSSC), November 2000.
|
| |
19
|
K. Chen and C. Hu, "Performance and Vdd Scaling in Deep Submicrometer CMOS," IEEE Journal of Solid State Circuits (JSSC), October 1998.
|
 |
20
|
|
| |
21
|
D. W. Bailey and B. J. Benschneider, "Clocking Design and Analysis for a 600-MHz Alpha Microprocessor," IEEE Journal of Solid State Circuits (JSSC), Nov 1998.
|
 |
22
|
|
 |
23
|
|
CITED BY 18
|
|
|
|
|
|
|
|
|
|
|
Venkata Syam P. Rapaka , Emil Talpes , Diana Marculescu, Mixed-clock issue queue design for energy aware, high-performance cores, Proceedings of the 2004 conference on Asia South Pacific design automation: electronic design and solution fair, p.380-383, January 27-30, 2004, Yokohama, Japan
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|