ACM Home Page
Please provide us with feedback. Feedback
Memory data organization for improved cache performance in embedded processor applications
Full text PdfPdf (685 KB)
Source ACM Transactions on Design Automation of Electronic Systems (TODAES) archive
Volume 2 ,  Issue 4  (October 1997) table of contents
Pages: 384 - 409  
Year of Publication: 1997
ISSN:1084-4309
Authors
Preeti Ranjan Panda  Univ. of California at Irvine, Irvine
Nikil D. Dutt  Univ. of California at Irvine, Irvine
Alexandru Nicolau  Univ. of California at Irvine, Irvine
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 81,   Citation Count: 25
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/268424.268464
What is a DOI?

ABSTRACT

Code generation for embedded processors opens up the possibility for several performance optimization techniques that have been ignored by traditional compilers due to compilation time constraints. We present techniques that take into account the parameters of the data caches for organizing scalar and array variables declared in embedded code into memory, with the objective of improving data cache performance. We present techniques for clustering variables to minimize compulsory cache misses, and for solving the memory assignment problem to minimize conflict cache misses. Our experiments with benchmark code kernels from DSP and other domains on the CW4001 embedded processor from LSI Logic indicate significant improvements in data cache performance by the application of our memory organization technique.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
ARAUJO, G., DEVADAS, S., KEUTZER, K., LIAO, S., MALIK, S., SUDARSANAM, A., TJIANG, S., AND WANG, A. 1995. Challenges in code generation for embedded systems. In Code Generation for Embedded Processors, P. Marwedel and G. Goosens, Eds., Kluwer Academic, 48-64.
 
5
6
 
7
 
8
9
 
10
 
11
 
12
 
13
GOOSENS, G., RABAEY, J., VANDEWALLE, J., AND MAN, H. D. 1990. An efficient microcode compiler for application specific DSP processors. IEEE Trans. CAD/ICAS 9, 9 (Sept.), 925-937.
 
14
15
16
 
17
 
18
LANNEER, D., PRAET,J.V.,KIFLI, A., SCHOOFS, K., GEURTS, W., THOEN, F., AND GOOSENS,G. 1995. Chess: Retargetable code generation for embedded DSP processors. In Code Generation for Embedded Processors, P. Marwedel and G. Goosens, Eds., Kluwer Academic, 65-84.
 
19
20
 
21
LIEM, C., MAY, T., AND PAULIN, P. 1994. Instruction-set matching and selection for DSP and ASIP code generation. In Proceedings of the European Design and Test Conference (March), 31-37.
 
22
23
24
 
25
 
26
 
27
PAULIN, P., LIEM, C., MAY, T., AND SUTARWALA, S. 1995. Flexware: A flexible firmware development environment for embedded systems. In Code Generation for Embedded Processors, P. Marwedel and G. Goosens, Eds., Kluwer Academic, 65-84.
 
28
RAWAT, J. 1993. Static analysis of cache performance for real-time programming. Tech. Rep., Iowa State University.
 
29
SCHENK, W. 1995. Retargetable code generation for parallel, pipelined processor structures. In Code Generation for Embedded Processors, P. Marwedel and G. Goosens, Eds., Kluwer Academic, 119-135.
 
30
 
31
32
 
33
YAMADA, Y., JOHNSON,T.L.,HAAB, G., GYLLENHAAL,J.C.,AND HWU, W. W. 1995. Reducing cache misses in numerical applications using data relocation and prefetching. Tech. Rep. CRHC-95-04, University of Illinois, Urbana.

CITED BY  25

Collaborative Colleagues:
Preeti Ranjan Panda: colleagues
Nikil D. Dutt: colleagues
Alexandru Nicolau: colleagues