ACM Home Page
Please provide us with feedback. Feedback
The block-based trace cache
Full text PdfPdf (181 KB)
Source International Symposium on Computer Architecture archive
Proceedings of the 26th annual international symposium on Computer architecture table of contents
Atlanta, Georgia, United States
Pages: 196 - 207  
Year of Publication: 1999
ISBN:0-7695-0170-2
Also published in ...
Authors
Bryan Black  Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
Bohuslav Rychlik  Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
John Paul Shen  Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA
Sponsors
IEEE-CS\TCCA : TC on Computer Arhitecture
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 33,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/300979.300996
What is a DOI?

ABSTRACT

The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buffering and reusing dynamic instruction traces. This work presents a new block-based trace cache implementation that can achieve higher IPC performance with more efficient storage of traces. Instead of explicitly storing instructions of a trace, pointers to blocks constituting a trace are stored in a much smaller trace table. The block-based trace cache renames fetch addresses at the basic block level and stores aligned blocks in a block cache. Traces are constructed by accessing the replicated block cache using block pointers from the trace table. Performance potential of the block-based trace cache is quantified and compared with perfect branch prediction and perfect fetch schemes. Comparing to the conventional trace cache, the block-based design can achieve higher IPC, with less impact on cycle time.Results: Using the SPECint95 benchmarks, a 16-wide realistic design of a block-based trace cache can improve performance 75% over a baseline design and to within 7% of a baseline design with perfect branch prediction. With idealized trace prediction, it is shown the block-based trace cache with an 1K-entry block cache achieves the same performance of the conventional trace cache with 32K entries.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
 
5
 
6
 
7
IBM Microelectronics Division, PowerPC 604 RISC Microprocessor User's Manual, 1994
 
8
 
9
 
10
S. McFarling, "Combining Branch Predictors." Technical Report TN-36, Digital Equipment Corp., June 1993
 
11
 
12
13
 
14
 
15
16
 
17
 
18
19

CITED BY  15

Collaborative Colleagues:
Bryan Black: colleagues
Bohuslav Rychlik: colleagues
John Paul Shen: colleagues