| Theoretical modeling of superscalar processor performance |
| Full text |
Pdf
(1.06 MB)
|
| Source
|
International Symposium on Microarchitecture
archive
Proceedings of the 27th annual international symposium on Microarchitecture
table of contents
San Jose, California, United States
Pages: 52 - 62
Year of Publication: 1994
ISBN:0-89791-707-3
|
|
Authors
|
|
Derek B. Noonburg
|
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
|
|
John P. Shen
|
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 36, Citation Count: 17
|
|
|
ABSTRACT
The current trace-driven simulation approach to determine superscalar processor performance is widely used but has some shortcomings. Modern benchmarks generate extremely long traces, resulting in problems with data storage, as well as very long simulation runtimes. More fundamentally, simulation generally does not provide significant insight into the factors that determine performance or a characterization of their interactions. This paper proposes a theoretical model of superscalar processor performance that addresses these shortcomings. Performance is viewed as an interaction of program parallelism and machine parallelism. Both program and machine parallelisms are decomposed into multiple component functions. Methods for measuring or computing these functions are described. The functions are combined to provide a model of the interaction between program and machine parallelisms and an accurate estimate of the performance. The computed performance, based on this model, is compared to simulated performance for six benchmarks from the SPEC92 suite on several configurations of the IBM RS/6000 instruction set architecture.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
AS92
|
|
| |
DAF94
|
|
| |
DSP93
|
Trung A. Diep , John P. Shen , Mike Phillip, EXPLORER: a retargetable and visualization-based trace-driven simulator for superscalar processors, Proceedings of the 26th annual international symposium on Microarchitecture, p.225-235, December 01-03, 1993, Austin, Texas, United States
|
| |
HP90
|
|
| |
Jou89
|
|
 |
LW92
|
|
| |
RF93
|
|
 |
TGH92
|
Kevin B. Theobald , Guang R. Gao , Laurie J. Hendren, On the limits of program parallelism and its smoothability, Proceedings of the 25th annual international symposium on Microarchitecture, p.10-19, December 01-04, 1992, Portland, Oregon, United States
|
 |
Wal91
|
|
CITED BY 17
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Engin Ipek , Sally A. McKee , Karan Singh , Rich Caruana , Bronis R. de Supinski , Martin Schulz, Efficient architectural design space exploration via predictive modeling, ACM Transactions on Architecture and Code Optimization (TACO), v.4 n.4, p.1-34, January 2008
|
|