ACM Home Page
Please provide us with feedback. Feedback
Data and memory optimization techniques for embedded systems
Full text PdfPdf (340 KB)
Source ACM Transactions on Design Automation of Electronic Systems (TODAES) archive
Volume 6 ,  Issue 2  (April 2001) table of contents
Pages: 149 - 206  
Year of Publication: 2001
ISSN:1084-4309
Authors
P. R. Panda  Synopsys, Inc., 700 E. Middlefield Rd., Mountain View, CA
F. Catthoor  Inter-University Microelectronics Centre and Katholieke Universiteit Leuven, Kapeldreef 75, Leuven, Belgium
N. D. Dutt  Center for Embedded Computer Systems, University of California at Irvine, Irvine, CA
K. Danckaert  Inter-University Microelectronics Centre, Kapeldreef 75, Leuven, Belgium
E. Brockmeyer  Inter-University Microelectronics Centre, Kapeldreef 75, Leuven, Belgium
C. Kulkarni  Inter-University Microelectronics Centre, Kapeldreef 75, Leuven, Belgium
A. Vandercappelle  Inter-University Microelectronics Centre, Kapeldreef 75, Leuven, Belgium
P. G. Kjeldsberg  Norwegian University of Science and Technology, Trondheim, Norway
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 46,   Downloads (12 Months): 554,   Citation Count: 82
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/375977.375978
What is a DOI?

ABSTRACT

We present a survey of the state-of-the-art techniques used in performing data and memory-related optimizations in embedded systems. The optimizations are targeted directly or indirectly at the memory subsystem, and impact one or more out of three important cost metrics: area, performance, and power dissipation of the resulting implementation. We first examine architecture-independent optimizations in the form of code transoformations. We next cover a broad spectrum of optimization techniques that address memory architectures at varying levels of granularity, ranging from register files to on-chip memory, data caches, and dynamic memory (DRAM). We end with memory addressing related issues.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
AHMAD,I.AND CHEN, C. Y. R. 1991. Post-processor for data path synthesis using multiport memories. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD '91, Santa Clara, CA, Nov. 11-14). IEEE Computer Society Press, Los Alamitos, CA, 276-279.
 
3
 
4
AMARASINGHE, S., ANDERSON, J., LAM, M., AND TSENG, C.-W. 1995. An overview of the suif compiler for scalable parallel machines. In Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing (San Francisco, CA, Feb.). SIAM, Philadelphia, PA.
 
5
 
6
 
7
BALAKRISHNAN, M., BANERJI,D.K.,MAJUMDAR,A.K.,LINDERS,J.G.,AND MAJITHIA,J. C. 1990. Allocation of multiport memories in data path synthesis. IEEE Trans. Comput.-Aided Des. 7, 4 (Apr.), 536-540.
 
8
 
9
 
10
 
11
 
12
BANERJEE, U., EIGENMANN, R., NICOLAU, A., AND PADUA, D. A. 1993. Automatic program parallelization. Proc. IEEE 81, 2 (Feb.), 211-243.
 
13
14
 
15
 
16
17
18
 
19
 
20
CATTHOOR, F., DANCKAERT, K., KULKARNI, C., AND OMNES, T. 2000. Data transfer and storage architecture issues and exploration in multimedia processors. In Programmable Digital Signal Processors: Architecture, Programming, and Applications, Y. H. Yu, Ed. Marcel Dekker, Inc., New York, NY.
 
21
CATTHOOR, F., JANSSEN, M., NACHTERGAELE, L., AND MAN, H. D. 1996. System-level dataflow transformations for power reduction in image and video processing. In Proceedings of the International Conference on Electronic Circuits and Systems on Electronic Circuits and Systems (Oct.). 1025-1028.
 
22
 
23
CATTHOOR, F., FRANSSEN, F., WUYTACK, S., NACHTERGAELE, L., AND DE MAN, H. 1994. Global communication and memory optimizing transformations for low power systems. In Proceed-ings of the International Workshop on Low Power Design. 203-208.
 
24
CHAITIN, G., AUSLANDER, M., CHANDRA, A., COCKE, J., HOPKINS, M., AND MARKSTEIN, P. 1981. Register allocation via coloring. Comput. Lang. 6, 1, 47-57.
25
 
26
27
28
29
30
 
31
 
32
DANCKAERT, K., CATTHOOR, F., AND MAN, H. D. 1999. Platform independent data transfer and storage exploration illustrated on a parallel cavity detection algorithm. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA99). 1669-1675.
33
 
34
 
35
36
 
37
 
38
DE GREEF,E.AND CATTHOOR, F. 1996. Reducing storage size for static control programs mapped onto parallel architectures. In Proceedings of the Dagstuhl Seminar on Loop Parallelisation (Schloss Dagstuhl, Germany, Apr.).
 
39
FEAUTRIER, P. 1991. Dataflow analysis of array and scalar references. Int. J. Parallel Program. 20, 1, 23-53.
 
40
 
41
 
42
FRANSSEN, F., BALASA, F., VAN SWAAIJ, M., CATTHOOR, F., AND MAN, H. D. 1993. Modeling multi-dimensional data and control flow. IEEE Trans. Very Large Scale Integr. Syst. 1,3 (Sept.), 319-327.
 
43
FRANSSEN, F., NACHTERGAELE, L., SAMSOM, H., CATTHOOR, F., AND MAN, H. D. 1994. Control flow optimization for fast system simulation and storage minimization. In Proceedings of the International Conference on Design and Test (Paris, Feb.). 20-24.
 
44
 
45
 
46
GHEZ, C., MIRANDA, M., VANDECAPPELLE, A., CATTHOOR, F., AND VERKEST, D. 2000. Systematic high-level address code transformations for piece-wise linear indexing: illustration on a medical imaging algorithm. In Proceedings of the IEEE Workshop on Signal Processing Systems (Lafayette, LA, Oct.). IEEE Press, Piscataway, NJ, 623-632.
47
48
 
49
 
50
GRANT, D., DENYER,P.B.,AND FINLAY, I. 1989. Synthesis of address generators. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD '89, Santa Clara, CA, Nov.). ACM Press, New York, NY, 116-119.
 
51
GRANT,D.M.,MEERBERGEN,J.V.,AND LIPPENS, P. E. R. 1994. Optimization of address generator hardware. In Proceedings of the 1994 Conference on European Design and Test (Paris, France, Feb.). 325-329.
 
52
 
53
 
54
55
 
56
 
57
 
58
59
60
 
61
HALAMBI, A., GRUN, P., TOMIYAMA, H., DUTT, N., AND NICOLAU, A. 1999b. Automatic software toolkit generation for embedded systems-on-chip. In Proceedings of the Conference on ICVC.
62
 
63
 
64
65
 
66
ISO/IEC MOVING PICTURE EXPERTS GROUP. 2001. The MPEG Home Page (http://www.cselt.it/ mpeg/)11.
 
67
ITOH, K., SASAKI, K., AND NAKAGOME, Y. 1995. Trends in low-power RAM circuit technologies. Proc. IEEE 83, 4 (Apr.), 524-543.
 
68
69
70
 
71
 
72
 
73
KHARE, A., PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1999. High-level synthesis with SDRAMs and RAMBUS DRAMs. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. E82-A, 11 (Nov.), 2347-2355.
74
 
75
KIROVSKI, D., LEE, C., POTKONJAK, M., AND MANGIONE-SMITH, W. 1999. Application-driven synthesis of memory-intensive systems-on-chip. IEEE Trans. Comput.-Aided Des. 18,9 (Sept.), 1316-1326.
 
76
77
 
78
79
 
80
 
81
 
82
 
83
KULKARNI,D.AND STUMM, M. 1995. Linear loop transformations in optimizing compilers for parallel machines. Aust. Comput. J. 27, 2 (May), 41-50.
84
85
 
86
 
87
 
88
89
90
91
92
93
 
94
MANJIAKIAN,N.AND ABDELRAHMAN, T. 1995. Fusion of loops for parallelism and locality. Tech. Rep. CSRI-315. Dept. of Computer Science, University of Toronto, Toronto, Ont., Canada.
 
95
MASSELOS, K., CATTHOOR, F., GOUTIS,C.E.,AND MAN, H. D. 1999a. A performance oriented use methodology of power optimizing code transformations for multimedia applications realized on programmable multimedia processors. In Proceedings of the IEEE Workshop on Signal Processing Systems (Taipeh, Taiwan). IEEE Computer Society Press, Los Alamitos, CA, 261-270.
96
97
 
98
99
 
100
MENG, T., GORDON, B., TSENG, E., AND HUNG, A. 1995. Portable video-on-demand in wireless communication. Proc. IEEE 83, 4 (Apr.), 659-690.
 
101
MIRANDA, M., CATTHOOR, F., AND MAN, H. D. 1994. Address equation optimization and hardware sharing for real-time signal processing applications. In Proceedings of the IEEE Workshop on VLSI Signal Processing VII (La Jolla, CA, Oct. 26-28). IEEE Press, Piscat-away, NJ, 208-217.
 
102
 
103
104
 
105
 
106
NEERACHER,M.AND RUHL, R. 1993. Automatic parallelization of linpack routines on distributed memory parallel processors. In Proceedings of the IEEE International Symposium on Parallel Processing (Newport Beach CA, Apr.). IEEE Computer Society Press, Los Alamitos, CA.
 
107
NICOLAU,A.AND NOVACK, S. 1993. Trailblazing: A hierarchical approach to percolation scheduling. In Proceedings of the International Conference on Parallel Processing: Software (Boca Raton, FL, Aug.). CRC Press, Inc., Boca Raton, FL, 120-124.
108
 
109
 
110
111
 
112
PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1998. Incorporating DRAM access modes into high-level synthesis. IEEE Trans. Comput.-Aided Des. 17, 2 (Feb.), 96-109.
 
113
PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1999a. Local memory exploration and optimization in embedded systems. IEEE Trans. Comput.-Aided Des. 18, 1 (Jan.), 3-13.
 
114
115
 
116
PARHI, K. 1989. Rate-optimal fully-static multiprocessor scheduling of data-flow signal processing programs. In Proceedings of the IEEE International Symposium on Circuits and Systems (Portland, OR, May). IEEE Press, Piscataway, NJ, 1923-1928.
 
117
PASSOS,N.AND SHA, E. 1994. Full parallelism of uniform nested loops by multi-dimensional retiming. In Proceedings of the 1994 International Conference on Parallel Processing (Aug.). CRC Press, Inc., Boca Raton, FL, 130-133.
 
118
 
119
PAUWELS, M., CATTHOOR, F., LANNEER, D., AND MAN, H. D. 1989. Type-handling in bit-true silicon compilation for dsp. In Proceedings of the European Conference on Circuit Theory and Design (Brighton, U.K., Sept.). 166-170.
 
120
 
121
 
122
QUILLERE,F.AND RAJOPADHYE, S. 1998. Optimizing memory usage in the polyhedral mode. In Proceedings of the Conference on Massively Parallel Computer Systems (Apr.).
 
123
RAMACHANDRAN, L., GAJSKI, D., AND CHAIYAKUL, V. 1993. An algorithm for array variable clustering. In Proceedings of the IEEE European Conference on Design Automation (EURO-DAC '93). IEEE Computer Society Press, Los Alamitos, CA.
124
 
125
 
126
127
128
 
129
 
130
131
 
132
SHIUE, W.-T., TADAS, S., AND CHAKRABARTI, C. 2000. Low power multi-module, multiport memory design for embedded systems. In Proceedings of the IEEE Workshop on Signal Processing Systems (Lafayette, LA, Oct.). IEEE Press, Piscataway, NJ, 529-538.
 
133
 
134
 
135
STOK,L.AND JESS, J. A. G. 1992. Foreground memory management in data path synthesis. Int. J. Circuits Theor. Appl. 20, 3, 235-255.
136
137
 
138
SYNOPSYS INC. 1997. Behavioral Compiler User Guide. Synopsys Inc, Mountain View, CA.
 
139
THIELE, L. 1989. On the design of piecewise regular processor arrays. In Proceedings of the IEEE International Symposium on Circuits and Systems (Portland, OR, May). IEEE Press, Piscataway, NJ, 2239-2242.
 
140
TOMIYAMA, H., HALAMB, A., GRUN, P., DUTT, N., AND NICOLAU, A. 1999. Architecture description languages for systems-on-chip design. In Proceedings of the 6th Asia Pacific Conference on Chip Design Languages (Fukuoka, Japan, Oct.). 109-116.
 
141
 
142
143
 
144
TSENG,C.AND SIEWIOREK, D. P. 1986. Automated synthesis of data paths in digital systems. IEEE Trans. Comput.-Aided Des. 5, 3 (July), 379-395.
145
 
146
VERBAUWHEDE, I., CATTHOOR, F., VANDEWALLE, J., AND MAN, H. D. 1989. Background memory management for the synthesis of algebraic algorithms on multi-processor dsp chips. In Proceedings of the IFIP 1989 International Conference on VLSI (IFIP VLSI '89, Munich, Aug.). IFIP, 209-218.
147
 
148
VERHAEGH, W., LIPPENS, P., AARTS, E., KORST, J., VAN MEERBERGEN, J., AND VAN DER WERF,A. 1995. Improved force-directed scheduling in high-throughput digital signal processing. IEEE Trans. Comput.-Aided Des. 14, 8 (Aug.), 945-960.
 
149
 
150
 
151
 
152
WOLFE, M. 1991. The tiny loop restructuring tool. In Proceedings of the 1991 International Conference on Parallel Processing (Aug.).
 
153
 
154
 
155
WUYTACK, S., DA SILVA,J.L.,CATTHOOR, F., JONG,G.D.,AND YKMAN-COUVREU, C. 1999b. Memory management for embedded network applications. IEEE Trans. Comput.-Aided Des. 18, 5 (May), 533-544.
 
156
 
157
158

CITED BY  82

Collaborative Colleagues:
P. R. Panda: colleagues
F. Catthoor: colleagues
N. D. Dutt: colleagues
K. Danckaert: colleagues
E. Brockmeyer: colleagues
C. Kulkarni: colleagues
A. Vandercappelle: colleagues
P. G. Kjeldsberg: colleagues