ACM Home Page
Please provide us with feedback. Feedback
Theoretical modeling of superscalar processor performance
Full text PdfPdf (1.06 MB)
Source International Symposium on Microarchitecture archive
Proceedings of the 27th annual international symposium on Microarchitecture table of contents
San Jose, California, United States
Pages: 52 - 62  
Year of Publication: 1994
ISBN:0-89791-707-3
Authors
Derek B. Noonburg  Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
John P. Shen  Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
Sponsors
IEEE-CS\TCMM : TC on Microprocessors & Microcomputers
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 36,   Citation Count: 17
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/192724.192730
What is a DOI?

ABSTRACT

The current trace-driven simulation approach to determine superscalar processor performance is widely used but has some shortcomings. Modern benchmarks generate extremely long traces, resulting in problems with data storage, as well as very long simulation runtimes. More fundamentally, simulation generally does not provide significant insight into the factors that determine performance or a characterization of their interactions. This paper proposes a theoretical model of superscalar processor performance that addresses these shortcomings. Performance is viewed as an interaction of program parallelism and machine parallelism. Both program and machine parallelisms are decomposed into multiple component functions. Methods for measuring or computing these functions are described. The functions are combined to provide a model of the interaction between program and machine parallelisms and an accurate estimate of the performance. The computed performance, based on this model, is compared to simulated performance for six benchmarks from the SPEC92 suite on several configurations of the IBM RS/6000 instruction set architecture.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

AS92
 
DAF94
 
DSP93
 
HP90
 
Jou89
LW92
 
RF93
TGH92
Wal91

CITED BY  17
 
 
 
 
 
 
 

Collaborative Colleagues:
Derek B. Noonburg: colleagues
John P. Shen: colleagues

Peer to Peer - Readers of this Article have also read: