| Enhanced code density of embedded CISC processors with echo technology |
| Full text |
Pdf
(88 KB)
|
| Source
|
International Conference on Hardware Software Codesign
archive
Proceedings of the 3rd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
table of contents
Jersey City, NJ, USA
SESSION: Accelerating applications through customized instruction sets
table of contents
Pages: 160 - 165
Year of Publication: 2005
ISBN:1-59593-161-9
|
|
Authors
|
|
Youfeng Wu
|
Intel Labs, Clara, CA
|
|
Mauricio Breternitz, Jr.
|
Intel Labs, Clara, CA
|
|
Herbert Hum
|
Intel Labs, Clara, CA
|
|
Ramesh Peri
|
Intel Labs, Clara, CA
|
|
Jay Pickett
|
Intel Labs, Clara, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 25, Citation Count: 0
|
|
|
ABSTRACT
Code density is an important issue in memory constrained systems. Some RISC processor, e.g. the THUMB extension in the ARM processor, supports aggressive code size reduction even at the cost of significant performance loss. In this paper, we develop an algorithm that utilizes a set of novel variable length Echo instructions and evaluate its effectiveness for IA32 binaries. Our experiments show that IA32 processor equipped with Echo instructions is capable of achieving a similar code density as the THUMB extension in the ARM instruction set with significantly lower performance penalty.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
ARM website, http://www.arm.com/products/CPUs/ archi-thumb2. html, 2004
|
| |
2
|
P. Brisk and M. Sarrafzadeh, "Framework and Design Methodology of a Compiler that Compresses Code using Echo Instructions," ODES-2, in conjunction with CGO04, March 21, 2004
|
 |
3
|
|
| |
4
|
GCC Code-Size Benchmark Environment (CSiBE), http://sed.inf.u-szeged.hu/csibe/, 2004
|
 |
5
|
|
 |
6
|
|
| |
7
|
C. Fraser. "An instruction for direct interpretation of LZ77-compressed programs," Microsoft Technical Report MSR-TR-2002-90. ftp://ftp.research.microsoft.com/pub/tr/tr-2002-90.pdf.
|
| |
8
|
Embedded Intel® Architecture at http://www.intel.com/products/embedded/index.htm
|
 |
9
|
|
| |
10
|
K. Kunchithapadam and J. Larus. "Using lightweight procedures to improve instruction cache performance," CS-TR-99-1390, University of Wisconsin, 1999.
|
| |
11
|
|
 |
12
|
Jeremy Lau , Stefan Schoenmackers , Timothy Sherwood , Brad Calder, Reducing code size with echo instructions, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
[doi> 10.1145/951710.951724]
|
| |
13
|
|
| |
14
|
C.G. Nevill-Manning, and I.H. Witten, "Identifying Hierarchical Structure in Sequences: A linear-time algorithm," Journal of Artificial Intelligence Research, 7, 67--82. (1997)
|
 |
15
|
|
| |
16
|
M. Nanja and J. D. Munter, "The Effects of Compiler Optimizations and Mixed Mode Code on Application Performance, Memory Footprint, Power, and Energy Consumption for Embedded Systems," Submitted to CTCES04.
|
| |
17
|
National Geode x86 "appliance-on-chip" SOCs, http: //www.linuxdevices.com/products/PD6094486551.html
|
| |
18
|
E. Ukkonen, "On-line construction of suffix trees," Algorithmica, Sept 1995.
|
| |
19
|
Via Embedded Platforms at http://www.viaembedded.com/index.jsp.
|
| |
20
|
J. Ziv and A Lempel, "A Universal Algorithm for Sequential Data Compression," IEEE Transaction on Information Theory, 23 (3), p337--343, May 1977.
|
|