|
ABSTRACT
We present a survey of the state-of-the-art techniques used in performing data and memory-related optimizations in embedded systems. The optimizations are targeted directly or indirectly at the memory subsystem, and impact one or more out of three important cost metrics: area, performance, and power dissipation of the resulting implementation.
We first examine architecture-independent optimizations in the form of code transoformations. We next cover a broad spectrum of optimization techniques that address memory architectures at varying levels of granularity, ranging from register files to on-chip memory, data caches, and dynamic memory (DRAM). We end with memory addressing related issues.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
AHMAD,I.AND CHEN, C. Y. R. 1991. Post-processor for data path synthesis using multiport memories. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD '91, Santa Clara, CA, Nov. 11-14). IEEE Computer Society Press, Los Alamitos, CA, 276-279.
|
| |
3
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
| |
4
|
AMARASINGHE, S., ANDERSON, J., LAM, M., AND TSENG, C.-W. 1995. An overview of the suif compiler for scalable parallel machines. In Proceedings of the SIAM Conference on Parallel Processing for Scientific Computing (San Francisco, CA, Feb.). SIAM, Philadelphia, PA.
|
| |
5
|
Raminder S. Bajwa , Mitsuru Hiraki , Hirotsugu Kojima , Douglas J. Gorny , Kenichi Nitta , Avadhani Shridhar , Koichi Seki , Katsuro Sasaki, Instruction buffering to reduce power in processors for signal processing, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, v.5 n.4, p.417-424, Dec. 1997
[doi> 10.1109/92.645068]
|
| |
6
|
|
| |
7
|
BALAKRISHNAN, M., BANERJI,D.K.,MAJUMDAR,A.K.,LINDERS,J.G.,AND MAJITHIA,J. C. 1990. Allocation of multiport memories in data path synthesis. IEEE Trans. Comput.-Aided Des. 7, 4 (Apr.), 536-540.
|
| |
8
|
Florin Balasa , Francky Catthoor , Hugo De Man, Dataflow-driven memory allocation for multi-dimensional signal processing systems, Proceedings of the 1994 IEEE/ACM international conference on Computer-aided design, p.31-34, November 06-10, 1994, San Jose, California, United States
|
| |
9
|
|
| |
10
|
Prithviraj Banerjee , John A. Chandy , Manish Gupta , Eugene W. Hodges IV , John G. Holm , Antonio Lain , Daniel J. Palermo , Shankar Ramaswamy , Ernesto Su, The Paradigm Compiler for Distributed-Memory Multicomputers, Computer, v.28 n.10, p.37-47, October 1995
[doi> 10.1109/2.467577]
|
| |
11
|
|
| |
12
|
BANERJEE, U., EIGENMANN, R., NICOLAU, A., AND PADUA, D. A. 1993. Automatic program parallelization. Proc. IEEE 81, 2 (Feb.), 211-243.
|
| |
13
|
|
 |
14
|
|
| |
15
|
|
| |
16
|
L. Benini , G. De Micheli , E. Macii , D. Sciuto , C. Silvano, Address bus encoding techniques for system-level power optimization, Proceedings of the conference on Design, automation and test in Europe, p.861-867, February 23-26, 1998, Le Palais des Congrés de Paris, France
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
| |
20
|
CATTHOOR, F., DANCKAERT, K., KULKARNI, C., AND OMNES, T. 2000. Data transfer and storage architecture issues and exploration in multimedia processors. In Programmable Digital Signal Processors: Architecture, Programming, and Applications, Y. H. Yu, Ed. Marcel Dekker, Inc., New York, NY.
|
| |
21
|
CATTHOOR, F., JANSSEN, M., NACHTERGAELE, L., AND MAN, H. D. 1996. System-level dataflow transformations for power reduction in image and video processing. In Proceedings of the International Conference on Electronic Circuits and Systems on Electronic Circuits and Systems (Oct.). 1025-1028.
|
| |
22
|
|
| |
23
|
CATTHOOR, F., FRANSSEN, F., WUYTACK, S., NACHTERGAELE, L., AND DE MAN, H. 1994. Global communication and memory optimizing transformations for low power systems. In Proceed-ings of the International Workshop on Low Power Design. 203-208.
|
| |
24
|
CHAITIN, G., AUSLANDER, M., CHANDRA, A., COCKE, J., HOPKINS, M., AND MARKSTEIN, P. 1981. Register allocation via coloring. Comput. Lang. 6, 1, 47-57.
|
 |
25
|
|
| |
26
|
|
 |
27
|
|
 |
28
|
José-Lorenzo Cruz , Antonio González , Mateo Valero , Nigel P. Topham, Multiple-banked register file architectures, Proceedings of the 27th annual international symposium on Computer architecture, p.316-325, June 2000, Vancouver, British Columbia, Canada
|
 |
29
|
Vinodh Cuppu , Bruce Jacob , Brian Davis , Trevor Mudge, A performance comparison of contemporary DRAM architectures, Proceedings of the 26th annual international symposium on Computer architecture, p.222-233, May 01-04, 1999, Atlanta, Georgia, United States
|
 |
30
|
Julio L. da Silva, Jr. , Francky Catthoor , Diederik Verkest , Hugo de Man, Power exploration for dynamic data types through virtual memory management refinement, Proceedings of the 1998 international symposium on Low power electronics and design, p.311-316, August 10-12, 1998, Monterey, California, United States
[doi> 10.1145/280756.280944]
|
| |
31
|
|
| |
32
|
DANCKAERT, K., CATTHOOR, F., AND MAN, H. D. 1999. Platform independent data transfer and storage exploration illustrated on a parallel cavity detection algorithm. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA99). 1669-1675.
|
 |
33
|
Koen Danckaert , Francky Catthoor , Hugo De Man, A preprocessing step for global loop transformations for data transfer optimization, Proceedings of the 2000 international conference on Compilers, architecture, and synthesis for embedded systems, p.34-40, November 17-19, 2000, San Jose, California, United States
[doi> 10.1145/354880.354886]
|
| |
34
|
|
| |
35
|
|
 |
36
|
J. Ph. Diguet , S. Wuytack , F. Catthoor , H. De Man, Formalized methodology for data reuse exploration in hierarchical memory mappings, Proceedings of the 1997 international symposium on Low power electronics and design, p.30-35, August 18-20, 1997, Monterey, California, United States
[doi> 10.1145/263272.263278]
|
| |
37
|
|
| |
38
|
DE GREEF,E.AND CATTHOOR, F. 1996. Reducing storage size for static control programs mapped onto parallel architectures. In Proceedings of the Dagstuhl Seminar on Loop Parallelisation (Schloss Dagstuhl, Germany, Apr.).
|
| |
39
|
FEAUTRIER, P. 1991. Dataflow analysis of array and scalar references. Int. J. Parallel Program. 20, 1, 23-53.
|
| |
40
|
|
| |
41
|
|
| |
42
|
FRANSSEN, F., BALASA, F., VAN SWAAIJ, M., CATTHOOR, F., AND MAN, H. D. 1993. Modeling multi-dimensional data and control flow. IEEE Trans. Very Large Scale Integr. Syst. 1,3 (Sept.), 319-327.
|
| |
43
|
FRANSSEN, F., NACHTERGAELE, L., SAMSOM, H., CATTHOOR, F., AND MAN, H. D. 1994. Control flow optimization for fast system simulation and storage minimization. In Proceedings of the International Conference on Design and Test (Paris, Feb.). 20-24.
|
| |
44
|
|
| |
45
|
|
| |
46
|
GHEZ, C., MIRANDA, M., VANDECAPPELLE, A., CATTHOOR, F., AND VERKEST, D. 2000. Systematic high-level address code transformations for piece-wise linear indexing: illustration on a medical imaging algorithm. In Proceedings of the IEEE Workshop on Signal Processing Systems (Lafayette, LA, Oct.). IEEE Press, Piscataway, NJ, 623-632.
|
 |
47
|
|
 |
48
|
G. Goossens , J. Vandewlle , H. De Man, Loop optimization in register-transfer scheduling for DSP-systems, Proceedings of the 26th ACM/IEEE conference on Design automation, p.826-831, June 25-28, 1989, Las Vegas, Nevada, United States
[doi> 10.1145/74382.74384]
|
| |
49
|
|
| |
50
|
GRANT, D., DENYER,P.B.,AND FINLAY, I. 1989. Synthesis of address generators. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD '89, Santa Clara, CA, Nov.). ACM Press, New York, NY, 116-119.
|
| |
51
|
GRANT,D.M.,MEERBERGEN,J.V.,AND LIPPENS, P. E. R. 1994. Optimization of address generator hardware. In Proceedings of the 1994 Conference on European Design and Test (Paris, France, Feb.). 325-329.
|
| |
52
|
|
| |
53
|
|
| |
54
|
Peter Grun , Florin Balasa , Nikil Dutt, Memory size estimation for multimedia applications, Proceedings of the 6th international workshop on Hardware/software codesign, p.145-149, March 15-18, 1998, Seattle, Washington, United States
|
 |
55
|
Peter Grun , Nikil Dutt , Alex Nicolau, Memory aware compilation through accurate timing extraction, Proceedings of the 37th conference on Design automation, p.316-321, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337428]
|
| |
56
|
|
| |
57
|
|
| |
58
|
|
 |
59
|
Sumit Gupta , Miguel Miranda , Francky Catthoor , Rajesh Gupta, Analysis of high-level address code transformations for programmable processors, Proceedings of the conference on Design, automation and test in Europe, p.9-13, March 27-30, 2000, Paris, France
[doi> 10.1145/343647.343683]
|
 |
60
|
Ashok Halambi , Peter Grun , Vijay Ganesh , Asheesh Khare , Nikil Dutt , Alex Nicolau, EXPRESSION: a language for architecture exploration through compiler/simulator retargetability, Proceedings of the conference on Design, automation and test in Europe, p.100-es, January 1999, Munich, Germany
[doi> 10.1145/307418.307549]
|
| |
61
|
HALAMBI, A., GRUN, P., TOMIYAMA, H., DUTT, N., AND NICOLAU, A. 1999b. Automatic software toolkit generation for embedded systems-on-chip. In Proceedings of the Conference on ICVC.
|
 |
62
|
Mary W. Hall , Timothy J. Harvey , Ken Kennedy , Nathaniel McIntosh , Kathryn S. McKinley , Jeffrey D. Oldham , Michael H. Paleczny , Gerald Roth, Experiences using the ParaScope Editor: an interactive parallel programming tool, ACM SIGPLAN Notices, v.28 n.7, p.33-43, July 1993
|
| |
63
|
Mary W. Hall , Jennifer M. Anderson , Saman P. Amarasinghe , Brian R. Murphy , Shih-Wei Liao , Edouard Bugnion , Monica S. Lam, Maximizing Multiprocessor Performance with the SUIF Compiler, Computer, v.29 n.12, p.84-89, December 1996
[doi> 10.1109/2.546613]
|
| |
64
|
|
 |
65
|
Chu-Yi Huang , Yen-Shen Chen , Youn-Long Lin , Yu-Chin Hsu, Data path allocation based on bipartite weighted matching, Proceedings of the 27th ACM/IEEE conference on Design automation, p.499-504, June 24-27, 1990, Orlando, Florida, United States
[doi> 10.1145/123186.123350]
|
| |
66
|
ISO/IEC MOVING PICTURE EXPERTS GROUP. 2001. The MPEG Home Page (http://www.cselt.it/ mpeg/)11.
|
| |
67
|
ITOH, K., SASAKI, K., AND NAKAGOME, Y. 1995. Trends in low-power RAM circuit technologies. Proc. IEEE 83, 4 (Apr.), 524-543.
|
| |
68
|
|
 |
69
|
|
 |
70
|
M. Kandemir , N. Vijaykrishnan , M. J. Irwin , W. Ye, Influence of compiler optimizations on system power, Proceedings of the 37th conference on Design automation, p.304-307, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337425]
|
| |
71
|
|
| |
72
|
|
| |
73
|
KHARE, A., PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1999. High-level synthesis with SDRAMs and RAMBUS DRAMs. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. E82-A, 11 (Nov.), 2347-2355.
|
 |
74
|
|
| |
75
|
KIROVSKI, D., LEE, C., POTKONJAK, M., AND MANGIONE-SMITH, W. 1999. Application-driven synthesis of memory-intensive systems-on-chip. IEEE Trans. Comput.-Aided Des. 18,9 (Sept.), 1316-1326.
|
| |
76
|
|
 |
77
|
|
| |
78
|
|
 |
79
|
David J. Kolson , Alexandru Nicolau , Nikil Dutt, Minimization of memory traffic in high-level synthesis, Proceedings of the 31st annual conference on Design automation, p.149-154, June 06-10, 1994, San Diego, California, United States
[doi> 10.1145/196244.196316]
|
| |
80
|
|
| |
81
|
|
| |
82
|
|
| |
83
|
KULKARNI,D.AND STUMM, M. 1995. Linear loop transformations in optimizing compilers for parallel machines. Aust. Comput. J. 27, 2 (May), 41-50.
|
 |
84
|
|
 |
85
|
Hae-Dong Lee , Sun-Young Hwang, A scheduling algorithm for multiport memory minimization in datapath synthesis, Proceedings of the 1995 conference on Asia Pacific design automation (CD-ROM), p.16-es, August 29-September 01, 1995, Makuhari, Massa, Chiba, Japan
[doi> 10.1145/224818.224847]
|
| |
86
|
|
| |
87
|
|
| |
88
|
|
 |
89
|
|
 |
90
|
|
 |
91
|
Clifford Liem , Pierre Paulin , Ahmed Jerraya, Address calculation for retargetable compilation and exploration of instruction-set architectures, Proceedings of the 33rd annual conference on Design automation, p.597-600, June 03-07, 1996, Las Vegas, Nevada, United States
[doi> 10.1145/240518.240631]
|
 |
92
|
|
 |
93
|
Tai Ly , David Knapp , Ron Miller , Don MacMillen, Scheduling using behavioral templates, Proceedings of the 32nd ACM/IEEE conference on Design automation, p.101-106, June 12-16, 1995, San Francisco, California, United States
[doi> 10.1145/217474.217514]
|
| |
94
|
MANJIAKIAN,N.AND ABDELRAHMAN, T. 1995. Fusion of loops for parallelism and locality. Tech. Rep. CSRI-315. Dept. of Computer Science, University of Toronto, Toronto, Ont., Canada.
|
| |
95
|
MASSELOS, K., CATTHOOR, F., GOUTIS,C.E.,AND MAN, H. D. 1999a. A performance oriented use methodology of power optimizing code transformations for multimedia applications realized on programmable multimedia processors. In Proceedings of the IEEE Workshop on Signal Processing Systems (Taipeh, Taiwan). IEEE Computer Society Press, Los Alamitos, CA, 261-270.
|
 |
96
|
K. Masselos , K. Danckaert , F. Catthoor , C. E. Goutis , H. DeMan, A methodology for power efficient partitioning of data-dominated algorithm specifications within performance constraints, Proceedings of the 1999 international symposium on Low power electronics and design, p.270-272, August 16-17, 1999, San Diego, California, United States
[doi> 10.1145/313817.313946]
|
 |
97
|
|
| |
98
|
|
 |
99
|
|
| |
100
|
MENG, T., GORDON, B., TSENG, E., AND HUNG, A. 1995. Portable video-on-demand in wireless communication. Proc. IEEE 83, 4 (Apr.), 659-690.
|
| |
101
|
MIRANDA, M., CATTHOOR, F., AND MAN, H. D. 1994. Address equation optimization and hardware sharing for real-time signal processing applications. In Proceedings of the IEEE Workshop on VLSI Signal Processing VII (La Jolla, CA, Oct. 26-28). IEEE Press, Piscat-away, NJ, 208-217.
|
| |
102
|
|
| |
103
|
|
 |
104
|
|
| |
105
|
|
| |
106
|
NEERACHER,M.AND RUHL, R. 1993. Automatic parallelization of linpack routines on distributed memory parallel processors. In Proceedings of the IEEE International Symposium on Parallel Processing (Newport Beach CA, Apr.). IEEE Computer Society Press, Los Alamitos, CA.
|
| |
107
|
NICOLAU,A.AND NOVACK, S. 1993. Trailblazing: A hierarchical approach to percolation scheduling. In Proceedings of the International Conference on Parallel Processing: Software (Boca Raton, FL, Aug.). CRC Press, Inc., Boca Raton, FL, 120-124.
|
 |
108
|
|
| |
109
|
|
| |
110
|
|
 |
111
|
|
| |
112
|
PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1998. Incorporating DRAM access modes into high-level synthesis. IEEE Trans. Comput.-Aided Des. 17, 2 (Feb.), 96-109.
|
| |
113
|
PANDA,P.R.,DUTT,N.D.,AND NICOLAU, A. 1999a. Local memory exploration and optimization in embedded systems. IEEE Trans. Comput.-Aided Des. 18, 1 (Jan.), 3-13.
|
| |
114
|
|
 |
115
|
|
| |
116
|
PARHI, K. 1989. Rate-optimal fully-static multiprocessor scheduling of data-flow signal processing programs. In Proceedings of the IEEE International Symposium on Circuits and Systems (Portland, OR, May). IEEE Press, Piscataway, NJ, 1923-1928.
|
| |
117
|
PASSOS,N.AND SHA, E. 1994. Full parallelism of uniform nested loops by multi-dimensional retiming. In Proceedings of the 1994 International Conference on Parallel Processing (Aug.). CRC Press, Inc., Boca Raton, FL, 130-133.
|
| |
118
|
|
| |
119
|
PAUWELS, M., CATTHOOR, F., LANNEER, D., AND MAN, H. D. 1989. Type-handling in bit-true silicon compilation for dsp. In Proceedings of the European Conference on Circuit Theory and Design (Brighton, U.K., Sept.). 166-170.
|
| |
120
|
|
| |
121
|
|
| |
122
|
QUILLERE,F.AND RAJOPADHYE, S. 1998. Optimizing memory usage in the polyhedral mode. In Proceedings of the Conference on Massively Parallel Computer Systems (Apr.).
|
| |
123
|
RAMACHANDRAN, L., GAJSKI, D., AND CHAIYAKUL, V. 1993. An algorithm for array variable clustering. In Proceedings of the IEEE European Conference on Design Automation (EURO-DAC '93). IEEE Computer Society Press, Los Alamitos, CA.
|
 |
124
|
|
| |
125
|
|
| |
126
|
|
 |
127
|
Luc Séméria , Koichi Sato , Giovanni De Micheli, Resolution of dynamic memory allocation and pointers for the behavioral synthesis form C, Proceedings of the conference on Design, automation and test in Europe, p.312-319, March 27-30, 2000, Paris, France
[doi> 10.1145/343647.343788]
|
 |
128
|
Barry Shackleford , Mitsuhiro Yasuda , Etsuko Okushi , Hisao Koizumi , Hiroyuki Tomiyama , Hiroto Yasuura, Memory-CPU size optimization for embedded system designs, Proceedings of the 34th annual conference on Design automation, p.246-251, June 09-13, 1997, Anaheim, California, United States
[doi> 10.1145/266021.266083]
|
| |
129
|
|
| |
130
|
|
 |
131
|
Wen-Tsong Shiue , Chaitali Chakrabarti, Memory exploration for low power, embedded systems, Proceedings of the 36th ACM/IEEE conference on Design automation, p.140-145, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.309902]
|
| |
132
|
SHIUE, W.-T., TADAS, S., AND CHAKRABARTI, C. 2000. Low power multi-module, multiport memory design for embedded systems. In Proceedings of the IEEE Workshop on Signal Processing Systems (Lafayette, LA, Oct.). IEEE Press, Piscataway, NJ, 529-538.
|
| |
133
|
Peter Slock , Sven Wuytack , Francky Catthoor , Gjalt de Jong, Fast and extensive system-level memory exploration for ATM applications, Proceedings of the 10th international symposium on System synthesis, p.74-81, September 17-19, 1997, Antwerp, Belgium
|
| |
134
|
|
| |
135
|
STOK,L.AND JESS, J. A. G. 1992. Foreground memory management in data path synthesis. Int. J. Circuits Theor. Appl. 20, 3, 235-255.
|
 |
136
|
|
 |
137
|
|
| |
138
|
SYNOPSYS INC. 1997. Behavioral Compiler User Guide. Synopsys Inc, Mountain View, CA.
|
| |
139
|
THIELE, L. 1989. On the design of piecewise regular processor arrays. In Proceedings of the IEEE International Symposium on Circuits and Systems (Portland, OR, May). IEEE Press, Piscataway, NJ, 2239-2242.
|
| |
140
|
TOMIYAMA, H., HALAMB, A., GRUN, P., DUTT, N., AND NICOLAU, A. 1999. Architecture description languages for systems-on-chip design. In Proceedings of the 6th Asia Pacific Conference on Chip Design Languages (Fukuoka, Japan, Oct.). 109-116.
|
| |
141
|
H. Tomiyama , T. Ishihara , A. Inoue , H. Yasuura, Instruction scheduling for power reduction in processor-based system design, Proceedings of the conference on Design, automation and test in Europe, p.855-860, February 23-26, 1998, Le Palais des Congrés de Paris, France
|
| |
142
|
|
 |
143
|
|
| |
144
|
TSENG,C.AND SIEWIOREK, D. P. 1986. Automated synthesis of data paths in digital systems. IEEE Trans. Comput.-Aided Des. 5, 3 (July), 379-395.
|
 |
145
|
Arnout Vandecappelle , Miguel Miranda , Erik Brockmeyer , Francky Catthoor , Diederik Verkest, Global multimedia system design exploration using accurate memory organization feedback, Proceedings of the 36th ACM/IEEE conference on Design automation, p.327-332, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.309945]
|
| |
146
|
VERBAUWHEDE, I., CATTHOOR, F., VANDEWALLE, J., AND MAN, H. D. 1989. Background memory management for the synthesis of algebraic algorithms on multi-processor dsp chips. In Proceedings of the IFIP 1989 International Conference on VLSI (IFIP VLSI '89, Munich, Aug.). IFIP, 209-218.
|
 |
147
|
Ingrid M. Verbauwhede , Chris J. Scheers , Jan M. Rabaey, Memory estimation for high level synthesis, Proceedings of the 31st annual conference on Design automation, p.143-148, June 06-10, 1994, San Diego, California, United States
[doi> 10.1145/196244.196313]
|
| |
148
|
VERHAEGH, W., LIPPENS, P., AARTS, E., KORST, J., VAN MEERBERGEN, J., AND VAN DER WERF,A. 1995. Improved force-directed scheduling in high-throughput digital signal processing. IEEE Trans. Comput.-Aided Des. 14, 8 (Aug.), 945-960.
|
| |
149
|
|
| |
150
|
|
| |
151
|
|
| |
152
|
WOLFE, M. 1991. The tiny loop restructuring tool. In Proceedings of the 1991 International Conference on Parallel Processing (Aug.).
|
| |
153
|
|
| |
154
|
|
| |
155
|
WUYTACK, S., DA SILVA,J.L.,CATTHOOR, F., JONG,G.D.,AND YKMAN-COUVREU, C. 1999b. Memory management for embedded network applications. IEEE Trans. Comput.-Aided Des. 18, 5 (May), 533-544.
|
| |
156
|
|
| |
157
|
|
 |
158
|
|
CITED BY 82
|
|
|
|
|
|
|
|
|
|
|
|
|
|
M. Miranda , C. Ghez , C. Kulkarni , F. Catthoor , D. Verkest, Systematic speed-power memory data-layout exploration for cache controlled embedded multimedia applications, Proceedings of the 14th international symposium on Systems synthesis, September 30-October 03, 2001, Montréal, P.Q., Canada
|
|
|
|
|
|
Takanori Okuma , Yun Cao , Masanori Muroyama , Hiroto Yasuura, Reducing access energy of on-chip data memory considering active data bitwidth, Proceedings of the 2002 international symposium on Low power electronics and design, August 12-14, 2002, Monterey, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chidamber Kulkarni , Matthias Gries , Christian Sauer , Kurt Keutzer, Programming challenges in network processor deployment, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hua Wang , Antonis Papanikolaou , Miguel Miranda , Francky Catthoor, A global bus power optimization methodology for physical design of memory dominated systems by coupling bus segmentation and activity driven block placement, Proceedings of the 2004 conference on Asia South Pacific design automation: electronic design and solution fair, p.759-761, January 27-30, 2004, Yokohama, Japan
|
|
|
|
|
|
|
|
|
Gwenolé Corre , Eric Senn , Pierre Bomel , Nathalie Julien , Eric Martin, Memory accesses management during high level synthesis, Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, September 08-10, 2004, Stockholm, Sweden
|
|
|
|
|
|
|
|
|
|
|
|
Bhuvan Middha , Matthew Simpson , Rajeev Barua, MTSS: multi task stack sharing for embedded systems, Proceedings of the 2005 international conference on Compilers, architectures and synthesis for embedded systems, September 24-27, 2005, San Francisco, California, USA
|
|
|
Jungeun Kim , Taewhan Kim, Memory access optimization through combined code scheduling, memory allocation, and array binding in embedded system design, Proceedings of the 42nd annual conference on Design automation, June 13-17, 2005, San Diego, California, USA
|
|
|
|
|
|
Matthew Simpson , Bhuvan Middha , Rajeev Barua, Segment protection for embedded systems using run-time checks, Proceedings of the 2005 international conference on Compilers, architectures and synthesis for embedded systems, September 24-27, 2005, San Francisco, California, USA
|
|
|
Poletti Francesco , Paul Marchal , David Atienza , Luca Benini , Francky Catthoor , Jose M. Mendias, An integrated hardware/software approach for run-time scratchpad management, Proceedings of the 41st annual conference on Design automation, June 07-11, 2004, San Diego, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nikolaos Kroupis , Nikolaos Zervas , Minas Dasygenis , Konstantinos Tatas , Antonios Argyriou , Dimitrios Soudris , Antonios Thanailakis, Behavioral-Level Performance and Power Exploration of Data-Intensive Applications Mapped on Programmable Processors, Journal of VLSI Signal Processing Systems, v.44 n.1-2, p.153-171, August 2006
|
|
|
|
|
|
|
|
|
Keoncheol Shin , Jungeun Kim , Seonggun Kim , Hwansoo Han, Restructuring field layouts for embedded memory systems, Proceedings of the conference on Design, automation and test in Europe: Proceedings, March 06-10, 2006, Munich, Germany
|
|
|
|
|
|
|
|
|
Stylianos Mamagkakis , Alexandros Bartzas , Georgios Pouiklis , David Atienza , Francky Catthoor , Dimitrios Soudris , Antonios Thanailakis, Systematic methodology for exploration of performance - Energy trade-offs in network applications using Dynamic Data Type refinement, Journal of Systems Architecture: the EUROMICRO Journal, v.53 n.7, p.417-436, July, 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kristof Denolf , Adrian Chirila-Rus , Paul Schumacher , Robert Turney , Kees Vissers , Diederik Verkest , Henk Corporaal, A systematic approach to design low-power video codec cores, EURASIP Journal on Embedded Systems, v.2007 n.1, p.42-42, January 2007
|
|
|
|
|
|
|
|
|
A. Milidonis , N. Alachiotis , V. Porpodas , H. Michail , A. P. Kakarountas , C. E. Goutis, Interactive presentation: A decoupled architecture of processors with scratch-pad memory hierarchy, Proceedings of the conference on Design, automation and test in Europe, April 16-20, 2007, Nice, France
|
|
|
|
|
|
|
|
|
|
|
|
M. Sanchez-Elez , M. Fernandez , M. Anido , H. Du , N. Bagherzadeh , R. Hermida, Low Energy Data Management for Different On-Chip Memory Levels in Multi-Context Reconfigurable Architectures, Proceedings of the conference on Design, Automation and Test in Europe, p.10036, March 03-07, 2003
|
|
|
|
|
|
David Atienza , Stylianos Mamagkakis , Francky Catthoor , Jose M. Mendias , Dimitris Soudris, Dynamic Memory Management Design Methodology for Reduced Memory Footprint in Multimedia and Wireless Network Applications, Proceedings of the conference on Design, automation and test in Europe, p.10532, February 16-20, 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
F. Balasa , P. G. Kjeldsberg , A. Vandecappelle , M. Palkovic , Q. Hu , H. Zhu , F. Catthoor, Storage Estimation and Design Space Exploration Methodologies for the Memory Management of Signal Processing Applications, Journal of Signal Processing Systems, v.53 n.1-2, p.51-71, November 2008
|
|
|
|
|
|
|
|
|
|
|
|
David Atienza , Praveen Raghavan , José L. Ayala , Giovanni De Micheli , Francky Catthoor , Diederik Verkest , Marisa López-Vallejo, Joint hardware-software leakage minimization approach for the register file of VLIW embedded architectures, Integration, the VLSI Journal, v.41 n.1, p.38-48, January, 2008
|
|
|
Kang Zhao , Jinian Bian , Sheqin Dong , Yang Song , Satoshi Goto, HyMacs: hybrid memory access optimization based on custom-instruction scheduling, Proceedings of the 18th ACM Great Lakes symposium on VLSI, May 04-06, 2008, Orlando, Florida, USA
|
|
|
|
|
|
|
|
|
|
|
|
Muthu Manikandan Baskaran , Uday Bondhugula , Sriram Krishnamoorthy , J. Ramanujam , Atanas Rountev , P. Sadayappan, Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, February 20-23, 2008, Salt Lake City, UT, USA
|
|
|
|
|
|
|
|
|
|
|
|
J. Ignacio Hidalgo , José L. Risco-Martín , David Atienza , Juan Lanchares, Analysis of multi-objective evolutionary algorithms to optimize dynamic data types in embedded systems, Proceedings of the 10th annual conference on Genetic and evolutionary computation, July 12-16, 2008, Atlanta, GA, USA
|
|
|
|
|
|
José L. Risco-Martín , Saurabh Mittal , David Atienza , J. Ignacio Hidalgo , Juan Lanchares, Optimization of dynamic data types in embedded systems using DEVS/SOA-based modeling and simulation, Proceedings of the 3rd international conference on Scalable information systems, June 04-06, 2008, Vico Equense, Italy
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
José L. Risco-Martín , J. Ignacio Hidalgo , David Atienza , Juan Lanchares , Oscar Garnica, Mixed heuristic and mathematical programming using reference points for dynamic data types optimization in multimedia embedded systems, Proceedings of the 11th Annual conference on Genetic and evolutionary computation, July 08-12, 2009, Montreal, Québec, Canada
|
|
|
|
INDEX TERMS
Primary Classification:
B.
Hardware
B.3
MEMORY STRUCTURES
Additional Classification:
B.
Hardware
B.5
REGISTER-TRANSFER-LEVEL IMPLEMENTATION
B.5.1
Design
Subjects:
Memory design
B.5.2
Design Aids
Subjects:
Optimization
B.7
INTEGRATED CIRCUITS
B.7.1
Types and Design Styles
Subjects:
Memory technologies
D.
Software
D.3
PROGRAMMING LANGUAGES
D.3.4
Processors
Subjects:
Optimization
General Terms:
Algorithms,
Design,
Experimentation,
Performance
Keywords:
DRAM,
SRAM,
address generation,
allocation,
architecture exploration,
code transformation,
data cache,
data optimization,
high-level synthesis,
memory architecture customization,
memory power dissipation,
register file,
size estimation,
survey
|