| Key features of the design methodology enabling a multi-core SoC implementation of a first-generation CELL processor |
| Full text |
Pdf
(312 KB)
|
| Source
|
Asia and South Pacific Design Automation Conference
archive
Proceedings of the 2006 Asia and South Pacific Design Automation Conference
table of contents
Yokohama, Japan
SESSION: Designers' forum: "cell" processor
table of contents
Pages: 871 - 878
Year of Publication: 2006
ISBN:0-7803-9451-8
|
|
Authors
|
|
Dac Pham
|
IBM Systems and Technology Group, Austin, TX
|
|
Hans-Werner Anderson
|
IBM Systems and Technology Group, Austin, TX
|
|
Erwin Behnen
|
IBM Systems and Technology Group, Austin, TX
|
|
Mark Bolliger
|
IBM Systems and Technology Group, Austin, TX
|
|
Sanjay Gupta
|
IBM Systems and Technology Group, Austin, TX
|
|
Peter Hofstee
|
IBM Systems and Technology Group, Austin, TX
|
|
Paul Harvey
|
IBM Systems and Technology Group, Austin, TX
|
|
Charles Johns
|
IBM Systems and Technology Group, Austin, TX
|
|
Jim Kahle
|
IBM Systems and Technology Group, Austin, TX
|
|
Atsushi Kameyama
|
Toshiba America Electronic Components, Austin, TX
|
|
John Keaty
|
IBM Systems and Technology Group, Austin, TX
|
|
Bob Le
|
IBM Systems and Technology Group, Austin, TX
|
|
Sang Lee
|
IBM Systems and Technology Group, Austin, TX
|
|
Tuyen Nguyen
|
IBM Systems and Technology Group, Austin, TX
|
|
John Petrovick
|
IBM Systems and Technology Group, Austin, TX
|
|
Mydung Pham
|
IBM Systems and Technology Group, Austin, TX
|
|
Juergen Pille
|
IBM Systems and Technology Group, Austin, TX
|
|
Stephen Posluszny
|
IBM Systems and Technology Group, Austin, TX
|
|
Mack Riley
|
IBM Systems and Technology Group, Austin, TX
|
|
Joseph Verock
|
IBM Systems and Technology Group, Austin, TX
|
|
James Warnock
|
IBM Systems and Technology Group, Austin, TX
|
|
Steve Weitzel
|
IBM Systems and Technology Group, Austin, TX
|
|
Dieter Wendel
|
IBM Systems and Technology Group, Austin, TX
|
|
| Sponsors |
|
| Publisher |
IEEE Press
Piscataway, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 67, Citation Count: 3
|
|
|
ABSTRACT
This paper reviews the design challenges that current and future processors must face, with stringent power limits and high frequency targets, and the design methods required to overcome the above challenges and address the continuing Giga-scale system integration trend. This paper then describes the details behind the design methodology that was used to successfully implement a first-generation CELL processor - a multi-core SoC. Key features of this methodology are broad optimization with fast rule-based analysis engines using macro-level abstraction for constraints propagation up/down the design hierarchy, coupled with accurate transistor level simulation for detailed analysis. The methodology fostered the modular design concept that is inherent to the CELL architecture, enabling a high frequency design by maximizing custom circuit content through re-use, and balanced power, frequency, and die size targets through global convergence capabilities. The design has roughly 241 million transistors implemented in 90nm SOI technology with 8 levels of copper interconnects and one local interconnect layer. The chip has been tested at various temperatures, voltages, and frequencies. Correct operation has been observed in the lab on first pass silicon at frequencies well over 4GHz.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
D. Pham et al, "The Design and Implementation of a First-Generation CELL Processor", ISSCC 2005 Digest of Technical Papers, Feb. 2005, pp. 184--185.
|
| |
2
|
B. Flachs et al, "The Microarchitecture of the Streaming Processor for a CELL Processor", ISSCC 2005 Digest of Technical Papers, Feb. 2005, pp. 134--135.
|
| |
3
|
T. Asano et al, "A 4.8GHz Fully Pipelined Embedded SRAM in the Streaming Processor of a CELL Processor", ISSCC 2005 Digest of Technical Papers, Feb. 2005, pp. 486--487.
|
| |
4
|
S. Clark et al, "IBM CELL Interconnect Unit, Bus and Memory Controller", Hot Chip'05, Aug. 2005, Paper #1.2
|
| |
5
|
F. Klass, C. Amir, A. Das, K. Aingaran, C. Truong, R. Wang, A. Mehta, R. Heald, G. Yee, "A New Family of Semi-dynamic and Dynamic Flip-Flops with Embedded Logic for High-Performance Processors", IEEE J. Solid State Circuits, vol. 34, pp. 712--716 (1999).
|
| |
6
|
L. Sigal , J. D. Warnock , B. W. Curran , Y. H. Chan , P. J. Camporese , M. D. Mayo , W. V. Huott , D. R. Knebel , C. T. Chuang , J. P. Eckhardt , P. T. Wu, Circuit design techniques for the high-performance CMOS IBM S/390 parallel enterprise server G4 microprocessor, IBM Journal of Research and Development, v.41 n.4-5, p.489-503, July/Sept. 1997
|
| |
7
|
P. J. Restle, et al, "A Clock Distribution Method for Microprocessors", IEEE J. Solid-State Circuits, vol. 36, pp 792--799, May 2001
|
| |
8
|
P. J. Restle, et al, "The Clock Distribution of the Power4 Microprocessor", IEEE International Solid-State Circuits Conference 2002 Digest of Technical Papers, vol. 45, pp 144--145
|
| |
9
|
K. Yazawa and M. Ishizuka, "Thermal Modeling with Transfer Function for the Transient Chip-On-Substrate Problem", Thermal Science and Engineering, vol. 13, No. 1, Heat Transfer Society of Japan, 2005, pp. 37--40
|
 |
10
|
S. Posluszny , N. Aoki , D. Boerstler , P. Coulman , S. Dhong , B. Flachs , P. Hofstee , N. Kojima , O. Kwon , K. Lee , D. Meltzer , K. Nowka , J. Park , J. Peter , J. Silberman , O. Takahashi , P. Villarrubia, “Timing closure by design,” a high frequency microprocessor design methodology, Proceedings of the 37th conference on Design automation, p.712-717, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337749]
|
| |
11
|
Rao, V., J. Soreff, T. Brodnax, and R. Mains, "EinsTLT: Transistor Level Timing with EinsTimer," Proc. Of Int. Workshop on Timing Issues (TAU), 1999.
|
| |
12
|
Lee, Sang Y, J. Warnock, E. Behnen, J. Soreff, V. Rao, and S. Posluszny, "Improved Transistor-Level Timing Methodology for a CELL Microprocessor," ASPDAC 2006 (submitted for publication)
|
| |
13
|
Warnock, J. D., Erwin Behnen, Sang Y. Lee, and Jeffrey Soreff, "Improved Method for Timing Margin Calculation," IBM Invention Publish, Feb. 2004.
|
| |
14
|
Behnen, E., Jeffrey Soreff, James D. Warnock, and Dieter Wendel, "Method to Apply Latch Transparency Locally While Avoiding It Globally During Timing," Filed with U. S. Patent Office, May 2004.
|
| |
15
|
Soreff, J., Vasant Rao, James D. Warnock, Sang Y. Lee, and David Winston, "Pulse waveform timing in EinsTLT templates," Filed with U. S. Patent Office, May 2004.
|
| |
16
|
Warnock, J. D. and Jeffrey Soreff, "Improved Method of Timing Model Abstraction for Circuits Potentially Simultaneously Switching Internal Signals," Filed with U. S. Patent Office, May 2004.
|
| |
17
|
Devgan, A. and R. A. Rohrer, "Adaptively controlled explicit simulation," IEEE Trans. Computer-Aided Design, vol. 13, pp.746--762, June 1994.
|
| |
18
|
Pillage, L. T. and R. A. Rohrer, "Asymptotic waveform evaluation for timing analysis," IEEE Trans. Computer-Aided Design, vol. 9, No. 4, pp. 352--366, April 1990.
|
| |
19
|
Ratzlaff, C. L, N. Gopal, and L. T. Pillage, "RICE: Rapid interconnect circuit evaluator," IEEE Trans. Computer-Aided Design, vol. 13, No. 6, pp. 763--776, June 1994.
|
| |
20
|
K. Chang et al, "Clocking and Circuit Design for a Parallel I/O on a First-Generation CELL Processor", ISSCC'05 Paper #28.9
|
| |
21
|
Pham, D. et al. "Overview of the Architecture, Circuit Design, and Physical Implementation of a First-Generation CELL Processor," JSSCC, October. 2005 Special issue (submitted for publication).
|
INDEX TERMS
Primary Classification:
C.
Computer Systems Organization
C.0
GENERAL
Additional Classification:
B.
Hardware
B.7
INTEGRATED CIRCUITS
B.7.0
General
B.8
Performance and Reliability
B.8.2
Performance Analysis and Design Aids
C.
Computer Systems Organization
C.4
PERFORMANCE OF SYSTEMS
Subjects:
Design studies
General Terms:
Design,
Performance,
Theory
Keywords:
64-bit Power Architecture,
CELL Processor,
Linux,
SOC,
SOI,
clock distribution,
correct-by-construction,
design dependency solution,
design environment,
design hierarchy,
digital thermal sensor,
flexible IO,
hardware content protection,
high-performance latch,
linear sensor,
local clock buffer,
modularity,
multi-core,
multi-operating system,
multi-threading,
power management,
re-use,
real-time system,
synergistic processor,
thermal management,
virtualization technology
|