| Automated cache optimizations using CME driven diagnosis |
| Full text |
Pdf
(1.11 MB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 14th international conference on Supercomputing
table of contents
Santa Fe, New Mexico, United States
Pages: 316 - 326
Year of Publication: 2000
ISBN:1-58113-270-0
|
|
Authors
|
|
Somnath Ghosh
|
Department of Electrical Engineering, Princeton University
|
|
Margaret Martonosi
|
Department of Electrical Engineering, Princeton University
|
|
Sharad Malik
|
Department of Electrical Engineering, Princeton University
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 1, Downloads (12 Months): 15, Citation Count: 4
|
|
|
ABSTRACT
Demonstrating our framework on a collection of scientific loop nests, we were able to reduce an average of 84% of cache misses in the optimizable loop nests. This work lays the groundwork for handling a wide range of optimizations through further study of solution patterns in the CME solution table.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
David F. Bacon , Jyh-Herng Chow , Dz-ching R. Ju , Kalyan Muthukumar , Vivek Sarkar, A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness, Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research, p.3, October 31-November 03, 1994, Toronto, Ontario, Canada
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
Somnath Ghosh , Margaret Martonosi , Sharad Malik, Precise miss analysis for program transformations with caches of arbitrary associativity, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.228-239, October 02-07, 1998, San Jose, California, United States
|
| |
11
|
|
 |
12
|
|
| |
13
|
M. Kandemir , A. Choudhary , J. Ramanujam , P. Banerjee, Improving locality using loop and data transformations in an integrated framework, Proceedings of the 31st annual ACM/IEEE international symposium on Microarchitecture, p.285-297, November 1998, Dallas, Texas, United States
|
| |
14
|
M. Lain, E. E. Rothberg, and M. E. Wolf. The cache performance of blocked algorithms. In Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems, Apr. 1991.
|
| |
15
|
|
 |
16
|
Margaret Martonosi , Anoop Gupta , Thomas Anderson, MemSpy: analyzing memory system bottlenecks in programs, Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, p.1-12, June 01-05, 1992, Newport, Rhode Island, United States
|
 |
17
|
|
 |
18
|
|
| |
19
|
M. F. P. O'Boyle and P. M. W. Knijnenburg. Efficient parallelisation using combined loop and data transformations. In Proceedings of the 8th International Workshop on Compilers for Parallel Computers, Jan. 2000.
|
 |
20
|
|
 |
21
|
O. Temam , C. Fricker , W. Jalby, Cache interference phenomena, Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.261-271, May 16-20, 1994, Nashville, Tennessee, United States
|
| |
22
|
X. Veto, J. Llosa, A. Gonzalez, and C. Cinraneta. A fast implementation of Cache Miss Equations. In Proceedings of the 8th International Workshop on Compilers for Parallel Computers, Jan. 2000.
|
 |
23
|
|
| |
24
|
|
 |
25
|
|
|