| Analytical cache models with applications to cache partitioning |
| Full text |
Pdf
(854 KB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 15th international conference on Supercomputing
table of contents
Sorrento, Italy
Pages: 1 - 12
Year of Publication: 2001
ISBN:1-58113-410-X
|
|
Authors
|
|
G. Edward Suh
|
Laboratory for Computer Science, Massachusetts Institute of Technology Cambridge, MA
|
|
Srinivas Devadas
|
Laboratory for Computer Science, Massachusetts Institute of Technology Cambridge, MA
|
|
Larry Rudolph
|
Laboratory for Computer Science, Massachusetts Institute of Technology Cambridge, MA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 80, Citation Count: 16
|
|
|
ABSTRACT
An accurate, tractable, analytic cache model for time-shared systems is presented, which estimates the overall cache miss-rate of a multiprocessing system with any cache size and time quanta. The input to the model consists of the isolated miss-rate curves for each process, the time quanta for each of the executing processes, and the total cache size. The output is the overall miss-rate. Trace-driven simulations demonstrate that the estimated miss-rate is very accurate. Since the model provides a fast and accurate way to estimate the effect of context switching, it is useful for both understanding the effect of context switching on caches and optimizing cache performance for time-shared systems. A cache partitioning mechanism is also presented and is shown to improve the cache miss-rate up to 25% over the normal LRU replacement policy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Cumpaq. Cumpaq A1phaStatiun family.
|
| |
3
|
|
| |
4
|
Susan J. Eggers , Joel S. Emer , Henry M. Levy , Jack L. Lo , Rebecca L. Stamm , Dean M. Tullsen, Simultaneous Multithreading: A Platform for Next-Generation Processors, IEEE Micro, v.17 n.5, p.12-19, September 1997
[doi> 10.1109/40.621209]
|
| |
5
|
C. Freeburit. The hewlett packard PA-RISC 8500 processor. TechiticM report, Itewlett Packard Laboratories, Oct. 1998.
|
 |
6
|
Antonio González , Mateo Valero , Nigel Topham , Joan M. Parcerisa, Eliminating cache conflict misses through XOR-based placement functions, Proceedings of the 11th international conference on Supercomputing, p.76-83, July 07-11, 1997, Vienna, Austria
[doi> 10.1145/263580.263599]
|
| |
7
|
J. L. Heititiitg. SPEC CPU2000: Measuriitg CPU performaitce iit the Itew Itfilleititium. IEEE Computer, July 2000.
|
 |
8
|
|
| |
9
|
D. B. Kirk. Process depeitdeitt static cache partitioitiitg for reM-time systems. IIt Real-Time Systems Symposium, 1988.
|
| |
10
|
|
 |
11
|
Jack L. Lo , Joel S. Emer , Henry M. Levy , Rebecca L. Stamm , Dean M. Tullsen , S. J. Eggers, Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading, ACM Transactions on Computer Systems (TOCS), v.15 n.3, p.322-354, Aug. 1997
[doi> 10.1145/263326.263382]
|
| |
12
|
P. Magitussoit aitd B. Weriter. Efficieitt memory simulatioit iit SimICS. lit 28th Annual Simulation Symposium, 1995.
|
| |
13
|
MIPS Techitologies, IItc. MIPS RIO000 Microprocessor" User's Manual, 1996.
|
 |
14
|
|
| |
15
|
J. Muoz. Data-Intensive Systems Benchmark Suite Analysis and Specification. http://www.aec.com/projectweb/dis, Juite 1999.
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
G. E. Suh aitd L. Rudolph. Set-associative cache models for time-shared systems. Techitical Report CSG Memo 433, Massachusetts IItstitute of Techitology, 2001.
|
 |
20
|
|
| |
21
|
|
| |
22
|
|
 |
23
|
Josep Torrellas , Andrew Tucker , Anoop Gupta, Benefits of cache-affinity scheduling in shared-memory multiprocessors: a summary, Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.272-274, May 10-14, 1993, Santa Clara, California, United States
|
 |
24
|
|
 |
25
|
|
| |
26
|
Marco Zagha , Brond Larson , Steve Turner , Marty Itzkowitz, Performance analysis using the MIPS R10000 performance counters, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), p.16-es, January 01-01, 1996, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/369028.369059]
|
CITED BY 16
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lisa R. Hsu , Steven K. Reinhardt , Ravishankar Iyer , Srihari Makineni, Communist, utilitarian, and capitalist cache policies on CMPs: caches as a shared resource, Proceedings of the 15th international conference on Parallel architectures and compilation techniques, September 16-20, 2006, Seattle, Washington, USA
|
|
|
|
|
|
Shimin Chen , Phillip B. Gibbons , Michael Kozuch , Vasileios Liaskovitis , Anastassia Ailamaki , Guy E. Blelloch , Babak Falsafi , Limor Fix , Nikos Hardavellas , Todd C. Mowry , Chris Wilkerson, Scheduling threads for constructive cache sharing on CMPs, Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures, June 09-11, 2007, San Diego, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fang Liu , Fei Guo , Yan Solihin , Seongbeom Kim , Abdulaziz Eker, Characterizing and modeling the behavior of context switch misses, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, October 25-29, 2008, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|