| Reducing cache misses by application-specific re-configurable indexing |
| Full text |
Pdf
(803 KB)
|
| Source
|
International Conference on Computer Aided Design
archive
Proceedings of the 2004 IEEE/ACM International conference on Computer-aided design
table of contents
Pages: 125 - 130
Year of Publication: 2004
ISBN:0-7803-8702-3
|
|
Authors
|
|
K. Patel
|
Politecnico di Torino, Italy
|
|
E. Macii
|
Politecnico di Torino, Italy
|
|
L. Benini
|
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
|
|
M. Poncino
|
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., PA, USA
|
|
| Publisher |
IEEE Computer Society
Washington, DC, USA
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 16, Citation Count: 2
|
|
|
ABSTRACT
The predictability of memory access patterns in embedded systems can be successfully exploited to devise effective application-specific cache optimizations. In this work, we propose an improved indexing scheme for direct-mapped caches, which drastically reduces the number of conflict misses by using application-specific information; the scheme is based on the selection of a subset of the address bits. With respect to similar approaches, our solution has two main strengths. First, it models the misses analytically by building a miss equation, and exploits a symbolic algorithm to compute the exact optimum solution (i.e., the subset of address bits to be used as cache index that minimizes conflict misses). Second, we designed a re-configurable bit selector, which can be programmed at run-time to fit the optimal cache indexing to a given application. Results show an average reduction of conflict misses of 24%, measured over a set of standard benchmarks, and for different cache configurations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
[1] A. Macil, L. Benini, M. Poncino, Memory Design Techniques for Low-Energy Embedded Systems, Kluwer Academic Publishers, 2002.
|
| |
2
|
[2] W. Wolf, M. Kandemir, "Memory System Optimization of Embedded Software," Proceedings of the IEEE, Vol. 91, No. 1, pp. 165-182, January 2003.
|
 |
3
|
|
 |
4
|
Rajeev Balasubramonian , David Albonesi , Alper Buyuktosunoglu , Sandhya Dwarkadas, Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures, Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture, p.245-257, December 2000, Monterey, California, United States
[doi> 10.1145/360128.360153]
|
 |
5
|
|
 |
6
|
|
| |
7
|
[7] J. M. Frailong, W. Jalby, J. Lenfant, "XOR-Schemes: A Flexible Data Organization in Parallel Memories," ICPP'85: International Conference on Parallel Processing, pp. 276-283, Aug. 1985.
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
 |
11
|
|
 |
12
|
|
 |
13
|
|
| |
14
|
|
| |
15
|
R. I. Bahar , E. A. Frohm , C. M. Gaona , G. D. Hachtel , E. Macii , A. Pardo , F. Somenzi, Algebric Decision Diagrams and Their Applications, Formal Methods in System Design, v.10 n.2-3, p.171-206, April -May 1997
[doi> 10.1023/A:1008699807402]
|
| |
16
|
[16] J. M. Rabaey, A. Chandrakasan, B. Nikoli¿ Digital Integrated Circuits, Second Edition, Prentice-Hall, 2002.
|
| |
17
|
[17] M. D. Hill, J. Elder, DineroIV Trace-Driven Uniprocessor Cache Simulator, www.cs.wisc.edu/markhill/DineroIV, 1998.
|
 |
18
|
|
| |
19
|
[19] T. Givargis, F. Vahid, "Platune: A Tuning Framework for System-on-a-Chip Platforms," IEEE Transactions on Computer Aided Design, Vol. 21, No. 11, Nov. 2002.
|
| |
20
|
[20] New Mexico State University TraceBase, tracebase.nmsu.edu.
|
|