ACM Home Page
Please provide us with feedback. Feedback
Data prefetch mechanisms
Full text PdfPdf (172 KB)
Source ACM Computing Surveys (CSUR) archive
Volume 32 ,  Issue 2  (June 2000) table of contents
Pages: 174 - 199  
Year of Publication: 2000
ISSN:0360-0300
Authors
Steven P. Vanderwiel  IBM Server Group, North Rochester, MN
David J. Lilja  Univ. of Minnesota, Minneapolis, MN
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 47,   Downloads (12 Months): 361,   Citation Count: 39
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/358923.358939
What is a DOI?

ABSTRACT

The expanding gap between microprocessor and DRAM performance has necessitated the use of increasingly aggressive techniques designed to reduce or hide the latency of main memory access. Although large cache hierarchies have proven to be effective in reducing this latency for the most frequently used data, it is still not uncommon for many programs to spend more than half their run times stalled on memory requests. Data prefetching has been proposed as a technique for hiding the access latency of data referencing patterns that defeat caching strategies. Rather than waiting for a cache miss to initiate a memory fetch, data prefetching anticipates such misses and issues a fetch to the memory system in advance of the actual memory reference. To be effective, prefetching must be implemented in such a way that prefetches are timely, useful, and introduce little overhead. Secondary effects such as cache pollution and increased memory bandwidth requirements must also be taken into consideration. Despite these obstacles, prefetching has the potential to significantly improve overall program execution time by overlapping computation with memory accesses. Prefetching strategies are diverse, and no single strategy has yet been proposed that provides optimal performance. The following survey examines several alternative approaches, and discusses the design tradeoffs involved when implementing a data prefetch strategy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
ANACKER,W.AND WANG, C. P. 1967. Performance evaluation of computing systems with memory hierarchies. IEEE Trans. Comput. 16,6, 764-773.
3
 
4
 
5
6
 
7
CASMIRA,J.P.AND KAELI, D. R. 1995. Modeling cache pollution. In Proceedings of the Second IASTED Conference on Modeling and Simulation. 123-126.
 
8
CHAN, K. K. 1996. Design of the HP PA 7200 CPU. Hewlett-Packard J. 47, 1, 25-33.
 
9
CHANG,P.Y.,KAELI, D., AND LIU, Y. 1994. Branchdirected data cache prefetching. In Proceedings of Second Workshop on Shared-Memory Multiprocessing Systems.
 
10
11
 
12
13
 
14
 
15
DAHLGREN, F., DUBOIS, M., AND STENSTROM,P. 1993. Fixed and adaptive sequential prefetching in shared-memory multiprocessors. In Proceedings of the International Conference on Parallel Processing (St. Charles, IL). 56-63.
 
16
FU, B., SAINI, A., AND GELSINGER, P. P. 1989. Performance and microarchitecture of the i486 processor. In Proceedings of the IEEE International Conference on Computer Design (Cambridge, MA). 182-187.
17
18
 
19
GORNISH,E.H.AND VEIDENBAUM, A. V. 1994. An integrated hardware/software scheme for shared-memory multiprocessors. In Proceedings of International Conference on Parallel Processing (St. Charles, IL). 281-284.
20
 
21
HARRISON,L.AND MEHROTRA, S. 1994. A data prefetch mechanism for accelerating general computation. 1351. University of Illinois at Urbana-Champaign, Champaign, IL.
22
23
24
 
25
26
 
27
 
28
29
 
30
31
 
32
OBERLIN, S., KESSLER, R., SCOTT, S., AND THORSON, G. 1996. Cray T3E architecture overview. Cray Supercomputers, Chippewa Falls, MN.
33
 
34
 
35
36
37
38
39
 
40
SMITH, A. J. 1978. Sequential program prefetching in memory hierarchies. IEEE Computer 11, 12, 7-21.
41
 
42
 
43
 
44
 
45
 
46
YOUNG,H.C.AND SHEKITA, E. J. 1993. An intelligent I-cache prefetch mechanism. In Proceedings of the International Conference on Computer Design (ICCD'93, Cambridge, MA). IEEE Computer Society Press, Los Alamitos, CA, 44-49.
47

CITED BY  39


REVIEW

"Andrea Prati : Reviewer"

The growing gap between microprocessor and memory performance has elected cache prefetching as a key research issue in recent years. Vanderwiel and Lilja present in this paper a survey on data prefetch mechanisms both for single processor an  more...

Collaborative Colleagues:
Steven P. Vanderwiel: colleagues
David J. Lilja: colleagues