ACM Home Page
Please provide us with feedback. Feedback
Exploring configurations of functional units in an out-of-order superscalar processor
Full text PdfPdf (894 KB)
Source International Symposium on Computer Architecture archive
Proceedings of the 22nd annual international symposium on Computer architecture table of contents
S. Margherita Ligure, Italy
Pages: 117 - 125  
Year of Publication: 1995
ISBN:0-89791-698-0
Also published in ...
Authors
Stéphan Jourdan  Institut de Recherche en Informatique de Toulouse, Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex, France
Pascal Sainrat  Institut de Recherche en Informatique de Toulouse, Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex, France
Daniel Litaize  Institut de Recherche en Informatique de Toulouse, Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex, France
Sponsors
IEEE-CS\TCCA : TC on Computer Arhitecture
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 28,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/223982.224366
What is a DOI?

ABSTRACT

This study has been carried out in order to determine cost-effective configurations of functional units for multiple-issue out-of-order superscalar processors. The trace-driven simulations were performed on the six integer and the fourteen floating-point programs from the SPEC 92 suite. We first evaluate the number of instructions allowed to be concurrently processed by the execution stages of the pipeline. We then apply some restrictions on the execution issue of different instruction classes in order to define these configurations. We conclude that five to nine functional units are necessary to exploit Instruction-Level Parallelism. An important point is that several data cache ports are required in a processor of degree 4 or more. Finally, we report on complementary results on the utilization rate of the functional units.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

BuPa92
 
Dec95
DEC, "Scheduling and Issuing Rules for the Alpha 21164", Product Documentation, November 1994
 
Hsu94
 
IbMo94
IBM, "PowerPC 604 RISC Microprocessor Technical Summary", IBM Advance Information, MPR604TSU-1, 1994
 
Inte93
Intel, "Pentium Processor User's Manual", 1993
 
John91
M. Johnson, "Superscalar Microprocessor Design, Prentice-Hall, 1991
 
Mips94
bliPS, "R10000 Microprocessor Product Overview", October, 1994
 
Moto91
 
Smit91
M.D. Smith, "Tracing with Pixie", Stanford University, April 1991
SmPl85
 
SmWe94
 
SPEC92
SPEC 92 -- Technical Manual --- Rev. 1.1, 1992
 
Sun95
SUN, "UltraSparc: Next Generation Superscalar 64- Bit Sparc", Compcon 95, 1995
 
Toma67
R.M. Tomasulo, "An efficient Algorithm for Exploiting Multiple Arithmetic Units", IBM Journal, vol. 11, January 1967.
YePa92


Collaborative Colleagues:
Stéphan Jourdan: colleagues
Pascal Sainrat: colleagues
Daniel Litaize: colleagues