| Instruction trace compression for rapid instruction cache simulation |
| Full text |
Pdf
(187 KB)
|
| Source
|
Design, Automation, and Test in Europe
archive
Proceedings of the conference on Design, automation and test in Europe
table of contents
Nice, France
SESSION: Advanced techniques for embedded processors design
table of contents
Pages: 803 - 808
Year of Publication: 2007
ISBN:978-3-9810801-2-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
EDA Consortium
San Jose, CA, USA
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 47, Citation Count: 1
|
|
|
ABSTRACT
Modern Application Specific Instruction Set Processors (ASIPs) have customizable caches, where the size, associativity and line size can all be customized to suit a particular application. To find the best cache size suited for a particular embedded system, the application(s) is/are executed, traces obtained, and caches simulated. Typically, program trace files can range from a few megabytes to several gigabytes. Simulation of cache performance using large program trace files is a time consuming process. In this paper, a novel instruction cache simulation methodology that can operate directly on a compressed program trace file without the need for decompression is presented. This feature allowed our simulation methodology to have an average speed up of 9.67 times compared to the existing state of the art tool (Dinero IV cache simulator), for a range of applications from the Mediabench suite.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Xtensa Processor, http://www.tensilica.com.
|
| |
2
|
Seagate Technology, http://www.seagate.com/docs/pdf/marketing/po-db35.pdf
|
 |
3
|
Haris Lekatsas , Jörg Henkel , Wayne Wolf, Code compression for low power embedded system design, Proceedings of the 37th conference on Design automation, p.294-299, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337423]
|
| |
4
|
J. Ziv and A. Lempel, "A Universal Algorithm for Sequential Data Compression," IEEE Trans. on Information Theory, v. 23, pp. 337 -- 343, May 1977.
|
| |
5
|
J. Ziv and A. Lempel, "Compression of Individual Sequences via Variable-Rate Coding," IEEE Trans. on Information Theory, v. 24, n. 5, Sept. 1978.
|
| |
6
|
T. A. Welch, "A Technique for High-Performance Data Compression," Computer, v. 17, n. 6, June 1984, pp. 8 -- 19.
|
 |
7
|
|
| |
8
|
E. E. Johnson and J. Ha, "PDATS: Lossless address space compression for reducing file size and access time," IEEE International Phoenix Conference on Computers and Communication, 1994.
|
| |
9
|
|
| |
10
|
C. G. Nevill-Manning and I. H. Witten, "Identifying Hierarchical Structure in Sequences: A linear-time algorithm," JAIR," pp. 67--82, 1997.
|
 |
11
|
|
| |
12
|
A. Milenkovic and M. Milenkovic, "Exploiting Streams in Instruction and Data Address Trace Compression," IEEE WWC, pp. 99--107, 2003.
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
18
|
J. Edler and M. D. Hill, "Dinero IV Trace-Driven Uniprocessor Cache Simulator," http://www.cs.wisc.edu/~markhill/DineroIV/.
|
 |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
|