|
ABSTRACT
Despite the numerous optimization and evaluation studies that have been conducted with TLBs over the years, there is still a deficiency in an indepth understanding of TLB characteristics from an application angle. This paper presents a detailed characterization study of the TLB behavior of the SPEC CPU2000 benchmark suite. The contributions of this work are in identifying important application characteristics for TLB studies, quantifying the SPEC2000 application behavior for these characteristics, as well as making pronouncements and suggestions for future research based on these results.Around one-fourth of the SPEC2000 applications (ammp, apsi, galgel, lucas, mcf, twolf and vpr) have significant TLB missrates. Both capacity and associativity are influencing factors on miss-rates, though they do not necessarily go hand-in-hand. Multi-level TLBs are definitely useful for these applications in cutting down access times without significant miss rate degradation. Superpaging to combine TLB entries may not be rewarding for many of these applications. Software management of TLBs in terms of determining what entries to prefetch, what entries to replace, and what entries to pin has a lot of potential to cut down miss rates considerably. Specifically, the potential benefits of prefetching TLB entries is examined, and Distance Prefetching is shown to give good prediction accuracy for these applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Thomas E. Anderson , Henry M. Levy , Brian N. Bershad , Edward D. Lazowska, The interaction of architecture and operating system design, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.108-120, April 08-11, 1991, Santa Clara, California, United States
|
 |
2
|
|
| |
3
|
K. Bala, M. F. Kaashoek, and W. E. Weihl. Software Prefetching and Caching for Translation Lookaside Buffers. In Proceedings of the Usenix Symposium on Operating Systems Design and Implementation, pages 243-253, 1994.
|
| |
4
|
D. Burger and T. Austin. The SimpleScalar Toolset, Version 3.0. http://www.simplescalar.org.
|
| |
5
|
J. F. Cantin and M. D. Hill. Cache Performance for Selected SPEC CPU2000 Benchmarks. October 2001. http://www.cs.wise.edu/multifacet/misc/spec2000cache-data/.
|
 |
6
|
|
 |
7
|
|
| |
8
|
S. P. E. Corporation. http://www.spec.org.
|
| |
9
|
|
| |
10
|
|
 |
11
|
Zhen Fang , Lixin Zhang , John Carter , Sally McKee , Wilson Hsieh, Online superpage promotion revisited (poster session), Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, p.114-115, June 18-21, 2000, Santa Clara, California, United States
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
|
 |
16
|
Bruce L. Jacob , Trevor N. Mudge, A look at several memory management units, TLB-refill mechanisms, and page table organizations, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.295-306, October 02-07, 1998, San Jose, California, United States
|
| |
17
|
|
| |
18
|
|
| |
19
|
G. B. Kandiraju and A. Sivasubramaniam. Characterizing the d-TLB behavior of the SPEC CPU2000 Benchmarks. Technical Report CSE-01-023, Dept. of Comp. Sci. & Eng., Penn State Univ., August, 2001.
|
 |
20
|
|
| |
21
|
|
| |
22
|
G. McFarland. "CMOS Technology Scaling and Its Impact on Cache Delay". PhD thesis, Computer Science Department, Stanford University, 1997.
|
 |
23
|
David Nagle , Richard Uhlig , Tim Stanley , Stuart Sechrest , Trevor Mudge , Richard Brown, Design tradeoffs for software-managed TLBs, Proceedings of the 20th annual international symposium on Computer architecture, p.27-38, May 16-19, 1993, San Diego, California, United States
|
| |
24
|
A. K. Osowski, J. Flynn, N. Meares, and D. J. Lilja. Adapting the SPEC2000 Benchmark Suite for Simulation-based Computer Architecture Research. Kluwer-Academic Publishers, 2000. (papers from Workshop on Workload Characterization).
|
| |
25
|
|
 |
26
|
|
 |
27
|
|
 |
28
|
|
 |
29
|
|
 |
30
|
|
| |
31
|
|
 |
32
|
|
CITED BY 5
|
|
|
|
|
Juan Navarro , Sitaram Iyer , Peter Druschel , Alan Cox, Practical, transparent operating system support for superpages, Proceedings of the 5th symposium on Operating systems design and implementation Due to copyright restrictions we are not able to make the PDFs for this conference available for downloading, December 09-11, 2002, Boston, Massachusetts
|
|
|
|
|
|
Jinzhan Peng , Guei-Yuan Lueh , Gansha Wu , Xiaogang Gou , Ryan Rakvic, A comprehensive study of hardware/software approaches to improve TLB performance for java applications on embedded systems, Proceedings of the 2006 workshop on Memory system performance and correctness, October 22-22, 2006, San Jose, California
|
|
|
|
|