| Instruction-driven clock scheduling with glitch mitigation |
| Full text |
Pdf
(1.02 MB)
|
Source
|
International Symposium on Low Power Electronics and Design
archive
Proceeding of the thirteenth international symposium on Low power electronics and design
table of contents
Bangalore, India
SESSION: Microarchitectural techniques
table of contents
Pages 357-362
Year of Publication: 2008
ISBN:978-1-60558-109-5
|
|
Authors
|
|
Gu-Yeon Wei
|
Harvard University, Cambridge, MA, USA
|
|
David Brooks
|
Harvard University, Cambridge, MA, USA
|
|
Ali Durlov Khan
|
Harvard University, Cambridge, MA, USA
|
|
Xiaoyao Liang
|
Harvard University, Cambridge, MA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 43, Citation Count: 0
|
|
|
ABSTRACT
Instruction-driven clock scheduling is a mechanism that minimizes clock power in deeply-pipelined datapaths. Analysis of realistic processor workloads shows a preponderance of bubbles persist through pipelines like the floating point unit. Clock scheduling ostensibly adapts pipeline depth with respect to bubbles in the instruction stream without performance loss. Unfortunately, shallower pipelines (i.e. longer pipe stages) are prone to larger amounts of glitches propagating through logic, increasing dynamic power. Experimentally measured results from a 130nm FPU test chip with flexible clocking capabilities show a super-linear increase in glitch-induced dynamic power for shallower pipelines. While higher glitch power can severely diminish the power savings offered by clock scheduling, judicious clocking of intermediate stages offers glitch mitigation to recover power savings for worst-case scenarios. Detailed analysis of clock scheduling applied to a FPU in a POWER4-like processor running realistic workloads shows an average net power savings of 15% compared to an aggressively clock-gated design.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
D. Brooks , P. Bose , V. Srinivasan , M. K. Gschwind , P. G. Emma , M. G. Rosenfield, New methodology for early-stage, microarchitecture-level power-performance analysis of microprocessors, IBM Journal of Research and Development, v.47 n.5-6, p.653-670, September 2003
|
| |
4
|
|
| |
5
|
David M. Brooks , Pradip Bose , Stanley E. Schuster , Hans Jacobson , Prabhakar N. Kudva , Alper Buyuktosunoglu , John-David Wellman , Victor Zyuban , Manish Gupta , Peter W. Cook, Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors, IEEE Micro, v.20 n.6, p.26-44, November 2000
[doi> 10.1109/40.888701]
|
| |
6
|
X. Liang, D. Brooks, and G.-Y. Wei, "A process variation tolerant floating-point unit with voltage interpolation and variable latency," in Proc. IEEE International Solid-State Circuits Conference, Feb. 2008.
|
| |
7
|
Hans Jacobson , Pradip Bose , Zhigang Hu , Alper Buyuktosunoglu , Victor Zyuban , Rick Eickemeyer , Lee Eisen , John Griswell , Doug Logan , Balaram Sinharoy , Joel Tendler, Stretching the Limits of Clock-Gating Efficiency in Server-Class Processors, Proceedings of the 11th International Symposium on High-Performance Computer Architecture, p.238-242, February 12-16, 2005
[doi> 10.1109/HPCA.2005.33]
|
 |
8
|
|
|