| Performance evaluation of the PowerPC 620 microarchitecture |
| Full text |
Pdf
(1.35 MB)
|
| Source
|
International Symposium on Computer Architecture
archive
Proceedings of the 22nd annual international symposium on Computer architecture
table of contents
S. Margherita Ligure, Italy
Pages: 163 - 174
Year of Publication: 1995
ISBN:0-89791-698-0
Also published in ...
|
|
Authors
|
|
Trung A. Diep
|
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania
|
|
Christopher Nelson
|
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania
|
|
John Paul Shen
|
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 46, Citation Count: 15
|
|
|
ABSTRACT
The PowerPC 620™ microprocessor is the most recent and performance leading member of the PowerPC™ family. The 64-bit PowerPC 620 microprocessor employs a two-phase branch prediction scheme, dynamic renaming for all the register files, distributed multi-entry reservation stations, true out-of-order execution by six execution units, and a completion buffer for ensuring precise exceptions. This paper presents an instruction-level performance evaluation of the 620 microarchitecture. A performance simulator is developed using the VMW (Visualization-based Microarchitecture Workbench) retargetable framework. The VMW-based simulator accurately models the microarchitecture down to the machine cycle level. Extensive trace-driven simulation is performed using the SPEC92 benchmarks. Detailed quantitative analyses of the effectiveness of all key microarchitecture features are presented.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Colwell and R. Steck. "A 0.61am BiCMOS Processor with Dynamic Execution." ISSCC Proc., 1995.
|
| |
2
|
|
| |
3
|
L. Gwennap. "Comparing RISC Microprocessors." Proc. of the Microprocessor Forum, Oct. 1994.
|
| |
4
|
L. Gwennap. "Intel's P6 Uses Decoupled Superscalar Design." Microprocessor Report, February, 1995.
|
| |
5
|
|
| |
6
|
|
| |
7
|
R. Jain. The Art of Computer Systems Performance Analysis, John Wiley & Sons, 1991.
|
| |
8
|
M. Johnson. Superscalar Microprocessor Design, Prentice-Hall, 1990.
|
| |
9
|
|
| |
10
|
|
| |
11
|
J. Lee and A. Smith. "Branch Prediction Strategies and Branch Target Buffer Design." Computer, Jan. 1984.
|
| |
12
|
|
| |
13
|
E Rubinfeld;,"An Overview of the Alpha AXP 21164 Microarchitecture. Proc. of Hot Chips VI, Oct. 1994.
|
| |
14
|
R. Tomasulo. "An Efficient Algorithm for Exploiting Multiple Arithmetic Units." IBM JRD, Jan. 1967.
|
 |
15
|
|
| |
16
|
IBM Assembler Language Reference Manual, 1990.
|
| |
17
|
Motorola Optimizing C and Fortran Compilation System User's Manual, 1992.
|
| |
18
|
PowerPC 601 RISC Microprocessor User's Manual, 1993.
|
| |
19
|
PowerPC 603 Microprocessor Implementation Definition, Book IV, 1992.
|
| |
20
|
PowerPC 604 Microprocessor Implementation Features Book IV, 1993.
|
| |
21
|
PowerPC 620 Microprocessor Implementation Definition, 1992.
|
| |
22
|
PowerPC Implementation Definition for the 601 Processor, Book IV, May, 1992
|
| |
23
|
RS/6000 Special Issue of the IBM JRD, Jan. 1990.
|
| |
24
|
PowerPC User Instruction Set Architecture, Book I, November 1993
|
| |
25
|
SPEC Newsletter. Systems Performance Evaluatmn Cooperative, 1992.
|
CITED BY 15
|
Parthasarathy Ranganathan , Vijay S. Pai , Sarita V. Adve, Using speculative retirement and larger instruction windows to narrow the performance gap between memory consistency models, Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, p.199-210, June 23-25, 1997, Newport, Rhode Island, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|