|
ABSTRACT
Large caches are necessary in current high-performance computer systems to provide the required high memory bandwidth. Because a small decrease in cache performance can result in significant system performance degradation, accurately characterizing the performance of large caches is important. Although measurements on actual systems have shown that operating systems and multiprogramming can affect cache performance, previous studies have not focused on these effects. We have developed a program tracing technique called ATUM (Address Tracing Using Microcode) that captures realistic traces of multitasking workloads including the operating system. Examining cache behavior using these traces from a VAX processor shows that both the operating system and multiprogramming activity significantly degrade cache performance, with an even greater proportional impact on large caches. From a careful analysis of the causes of this degradation, we explore various techniques to reduce this loss. While seemingly little can be done to mitigate the effect of system references, multitasking cache miss activity can be substantially reduced with small hardware additions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
AGARWAL, A., CHOW, P., HOROWITZ, M., ACKEN, J., SALZ, A., AND HENNESSY, J. On-chip instruction caches for high performance processors. In Proceedings of the Conference on Advanced Research in VLSI (Stanford, Cal., March 1987). MIT Press, Boston, Mass, 1987, pp. 1-24.
|
| |
3
|
AGARWAL, i., HOROWITZ, M., AND HENNESSY, J. An analytical cache model. Computer Systems Lab. Rep. 86-304, Stanford Univ., Sept. 1986.
|
 |
4
|
|
 |
5
|
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
| |
10
|
EASTON, M. C. Computation of cold-start miss ratios. IEEE Trans. Comput. C-27, 5 (May 1978).
|
 |
11
|
|
| |
12
|
Fu, J., KELLER, J. B., AND HADUCH, K.J. Aspects of the VAX 8800 C box design. Digital Tech. J. 4 (Feb. 1987), 41-51.
|
| |
13
|
GOODMAN, J. R. Cache memory optimization to reduce processor/memory traffic. Dept. of Computer Sciences, Univ. of Wisconsin-Madison, 1985.
|
 |
14
|
|
| |
15
|
HENNESSY, J.L. VLSI processor architecture. IEEE Trans. Comput. C-33, 12 (Dec. 1984).
|
| |
16
|
Mark Hill , Susan Eggers , Jim Larus , George Taylor , Glenn Adams , B. K. Bose , Garth Gibson , Paul Hansen , Jon Keller , Shing Kong , Corinna Lee , Daebum Lee , Joan Pendleton , Scott Ritchie , David Wood , Ben Zorn , Paul Hilfinger , Dave Hodges , Randy Katz , John Ousterhout , Dave Patterson, Design decisions in SPUR, Computer, v.19 n.11, p.8-22, Nov. 1986
[doi> 10.1109/MC.1986.1663096]
|
| |
17
|
HOROWITZ, M., AND CHOW, P. The MIPS-X microprocessor. In Proceedings o{ IEEE WESCON 85 (San Francisco, 1985). IEEE, New York, 1985.
|
| |
18
|
KAPLAN, K. R., AND WINDER, R. O. Cache-based computer systems. Comput. 6, 3 (March 1973), 30-36.
|
| |
19
|
|
| |
20
|
LAHA, S., PATEL, J. H., AND IYER, R.K. Accurate low-cost methods for performance evaluation of cache memory systems. Coordinated Science Laboratory, Univ. of Illinois, 1986.
|
| |
21
|
MOUSSOURIS, J. ET AL. A CMOS RISC processor with integrated system functions. In COMP- CON (San Francisco, Mar. 1986). IEEE, New York, March 1986, pp. 126-131.
|
| |
22
|
PA~'rERSON, D. A., AND SEQUIN, C.H. Design considerations for single-chip computers of the future. IEEE Trans. Comput. C-29, 2 (Feb. 1980), 108-116.
|
 |
23
|
|
 |
24
|
|
| |
25
|
|
 |
26
|
|
 |
27
|
|
 |
28
|
|
 |
29
|
|
 |
30
|
|
 |
31
|
|
| |
32
|
|
| |
33
|
Vax-11 Architecture Reference Manual. Form EK-VARAR-RM-001, Digital Equipment Corp., Bedford, Mass., 1982.
|
CITED BY 67
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael Huang , Jose Renau , Seung-Moon Yoo , Josep Torrellas, L1 data cache decomposition for energy efficiency, Proceedings of the 2001 international symposium on Low power electronics and design, p.10-15, August 2001, Huntington Beach, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Srilatha Manne , Dirk Grunwald , Fabio Somenzi, Remembrance of things past: locality and memory in BDDs, Proceedings of the 34th annual conference on Design automation, p.196-201, June 09-13, 1997, Anaheim, California, United States
|
|
|
Antonio González , Mateo Valero , Nigel Topham , Joan M. Parcerisa, Eliminating cache conflict misses through XOR-based placement functions, Proceedings of the 11th international conference on Supercomputing, p.76-83, July 07-11, 1997, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nigel Topham , Antonio González , José González, The design and performance of a conflict-avoiding cache, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.71-80, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
|
|
|
|
|
David Nagle , Richard Uhlig , Tim Stanley , Stuart Sechrest , Trevor Mudge , Richard Brown, Design tradeoffs for software-managed TLBs, ACM SIGARCH Computer Architecture News, v.21 n.2, p.27-38, May 1993
|
|
|
|
|
|
|
|
|
Richard Uhlig , David Nagle , Tim Stanley , Trevor Mudge , Stuart Sechrest , Richard Brown, Design tradeoffs for software-managed TLBs, ACM Transactions on Computer Systems (TOCS), v.12 n.3, p.175-205, Aug. 1994
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ravi Bhargava , Juan Rubio , Srikanth Kannan , Lizy K. John , David Christie , Leo Klaes, Understanding the impact of X86/NT computing on microarchitecture, Workload characterization of emerging computer applications, Kluwer Academic Publishers, Norwell, MA, 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fang Liu , Fei Guo , Yan Solihin , Seongbeom Kim , Abdulaziz Eker, Characterizing and modeling the behavior of context switch misses, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, October 25-29, 2008, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
REVIEW
"Wilfred J. Hansen : Reviewer"
Cache memory between main memory and the processor increases both
performance and hardware cost. Analysis of the trade-offs for proposed
hardware is difficult and is usually done by
simulation driven from some imagined or measured workload. In t
more...
|