ACM Home Page
Please provide us with feedback. Feedback
Trace cache: a low latency approach to high bandwidth instruction fetching
Full text PdfPdf (1.38 MB)
Source International Symposium on Microarchitecture archive
Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture table of contents
Paris, France
Pages: 24 - 35  
Year of Publication: 1996
ISBN:0-8186-7641-8
Authors
Eric Rotenberg  Computer Science Dept., Univ. of Wisconsin - Madison
Steve Bennett  Intel Corporation
James E. Smith  Dept. of Elec. and Comp. Engr., Univ. of Wisconsin - Madison
Sponsors
IEEE-CS\TCMM : TC on Microprocessors & Microcomputers
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 94,   Citation Count: 110
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements will also increase. It will become necessary to fetch multiple basic blocks per cycle. Conventional instruction caches hinder this effort because long instruction sequences are not always in contiguous cache locations. We propose supplementing the conventional instruction cache with a trace cache. This structure caches traces of the dynamic instruction stream, so instructions that are otherwise noncontiguous appear contiguous. For the Instruction Benchmark Suite (IBS) and SPEC92 integer benchmarks, a 4 kilobyte trace cache improves performance on average by 28% over conventional sequential fetching. Further, it is shown that the trace cache's efficient, low latency approach enables it to outperform more complex mechanisms that work solely out of the instruction cache.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
5
6
 
7
 
8
J. Lee and A. J. Smith. Branch prediction strategies and branch target buffer design. IEEE Computer, 21(7):6-22, Jan 1984.
 
9
J. Losq. Generalized history table for branch prediction. IBM Technical Disclosure Bulletin, 25(1 ):99-101, June 1982.
 
10
11
 
12
E. Rotenberg, S. Bennett, and J. Smith. Trace cache: a low latency approach to high bandwidth instruction fetching. Tech Report 1310, CS Dept., Univ. ofWisc. - Madison, 1996.
 
13
14
 
15
16
17

CITED BY  110

Collaborative Colleagues:
Eric Rotenberg: colleagues
Steve Bennett: colleagues
James E. Smith: colleagues