| Finding optimal L1 cache configuration for embedded systems |
| Full text |
Pdf
(271 KB)
|
| Source
|
Asia and South Pacific Design Automation Conference
archive
Proceedings of the 2006 Asia and South Pacific Design Automation Conference
table of contents
Yokohama, Japan
SESSION: Memory optimization for embedded systems
table of contents
Pages: 796 - 801
Year of Publication: 2006
ISBN:0-7803-9451-8
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
IEEE Press
Piscataway, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 38, Citation Count: 4
|
|
|
ABSTRACT
Modern embedded system execute a single application or a class of applications repeatedly. A new emerging methodology of designing embedded system utilizes configurable processors where the cache size, associativity, and line size can be chosen by the designer. In this paper, a method is given to rapidly find the L1 cache miss rate of an application. An energy model and an execution time model are developed to find the best cache configuration for the given embedded application. Using benchmarks from Mediabench, we find that our method is on average 45 times faster to explore the design space, compared to Dinero IV while still having 100% accuracy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Xtensa Processor, http://www.tensilica.com.
|
| |
2
|
J. Montanaro et al., "A 160MHz, 32b, 0.5W CMOS RISC microprocessor," JSSC, vol.31 (11), pp. 1703--1712, 1996.
|
 |
3
|
Joshua J. Pieper , Alain Mellan , JoAnn M. Paul , Donald E. Thomas , Faraydon Karim, High level cache simulation for heterogeneous multiprocessors, Proceedings of the 41st annual conference on Design automation, June 07-11, 2004, San Diego, CA, USA
[doi> 10.1145/996566.996652]
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
J. Edler and M. D. Hill, "Dinero IV Trace-Driven Uniprocessor Cache Simulator," http://www.cs.wisc.edu/~markhill/DineroIV/.
|
| |
8
|
R. L. Mattson et.al., "Evaluation Techniques for Storage Hierarchies," IBM System Journal, vol.9, no.2, pp. 78--117, 1970.
|
| |
9
|
|
 |
10
|
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
16
|
D. C. Burger and T. M. Austin, "The SimpleScalar tool-set, Version 2.0," Technical Report 1342, Department of Computer Science, UW, June, 1997.
|
| |
17
|
P. Shivakumar and N. P. Jouppi, "Cacti 3.0: An Integrated Cache Timing, Power, and Area Model," Technical Report 2001/2, Compaq Computer Corporation, August, 2001. 2001.
|
| |
18
|
K. Hardee et.al., "A 0.6V 205MHz 19.5ns tRC 16Mb Embedded DRAM," ISSCC, 2004.
|
 |
19
|
|
| |
20
|
|
CITED BY 4
|
|
Andhi Janapsatya , Aleksandar Ignjatovic , Sri Parameswaran , Joerg Henkel, Instruction trace compression for rapid instruction cache simulation, Proceedings of the conference on Design, automation and test in Europe, April 16-20, 2007, Nice, France
|
|
|
Pablo Viana , Ann Gordon-Ross , Edna Barros , Frank Vahid, A table-based method for single-pass cache optimization, Proceedings of the 18th ACM Great Lakes symposium on VLSI, May 04-06, 2008, Orlando, Florida, USA
|
|
|
|
|
|
|
|