ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Memory-system design considerations for dynamically-scheduled processors
Full text PdfPdf (1.80 MB)
Source ACM SIGARCH Computer Architecture News archive
Volume 25 ,  Issue 2  (May 1997) table of contents
Special Issue: Proceedings of the 24th annual international symposium on Computer architecture (ISCA '97)
Pages: 133 - 143  
Year of Publication: 1997
ISSN:0163-5964
Also published in ...
Authors
Keith I. Farkas  Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Toronto, Ontario M5S 3G4, Canada
Paul Chow  Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Toronto, Ontario M5S 3G4, Canada
Norman P. Jouppi  Digital Equipment Corporation, Western Research Lab, 250 University Avenue, Palo Alto, California
Zvonko Vranesic  Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Toronto, Ontario M5S 3G4 Canada
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 42,   Citation Count: 24
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/384286.264156
What is a DOI?

Warning: The download time has expired please click on the item to try again.


ABSTRACT

In this paper, we identify performance trends and design relationships between the following components of the data memory hierarchy in a dynamically-scheduled processor: the register file, the lockup-free data cache, the stream buffers, and the interface between these components and the lower levels of the memory hierarchy. Similar performance was obtained from all systems having support for fewer than four in-flight misses, irrespective of the register-file size, the issue width of the processor, and the memory bandwidth. While providing support for more than four in-flight misses did increase system performance, the improvement was less than that obtained by increasing the number of registers. The addition of stream buffers to the investigated systems led to a significant performance increase, with the larger increases for systems having less in-flight-miss support, greater memory bandwidth, or more instruction issue capability. The performance of these systems was not significantly affected by the inclusion of traffic filters, dynamic-stride calculators, or the inclusion of the per-load non-unity stride-predictor and the incremental-prefetching techniques, which we introduce. However, the incremental prefetching technique reduces the bandwidth consumed by stream buffers by 50% without a significant impact on performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Norm Jouppi. Improving Direct Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers. Technical Report TN-15, Digital E, quipment Corporation Western Research Lab, March 1990.
 
2
Keith I. Farkas. Memory-system Design Considerations for Dynamically-scheduled Microprocessors. PhD thesis, Department of Electrical and Computer Engineering, University of Toronto, Ontario, Canada, January 1997. (URL: http:llwww.eeeg.toronto.edul,,,farkaslthesis.phd.html).
 
3
4
 
5
 
6
Linley Gwermap. PA-8000 Combines Complexity and Speed. Microprocessor Reports, 8(15):1,6-11, 1994.
 
7
 
8
IBM Microelectronics. PowerPC 620 RISC Microprocessor Technical Summary, 10 1994. document number: MPR620TSU-01.
 
9
10
11
12
 
13
Scott McFarling. Combining Branch Predictors. Digital Equipment Corporation Western Research Lab Technical Note TN-36, 1993.
14
 
15
Keith I. Farkas, Paul Chow, Norman P. Jouppi, and Zvonko Vranesie. Memory-system Design Considerations for Dynamically-scheduled Processors. Technical Report 1, Digital Equipment Corporation Western Research Lab, 1997. (URL: http://www.research.digital.eom/wrl/teehreports).

CITED BY  24

Collaborative Colleagues:
Keith I. Farkas: colleagues
Paul Chow: colleagues
Norman P. Jouppi: colleagues
Zvonko Vranesic: colleagues