| Exploiting shared scratch pad memory space in embedded multiprocessor systems |
| Full text |
Pdf
(255 KB)
|
| Source
|
Annual ACM IEEE Design Automation Conference
archive
Proceedings of the 39th annual Design Automation Conference
table of contents
New Orleans, Louisiana, USA
SESSION: Memory management and address optimization in embedded systems
table of contents
Pages: 219 - 224
Year of Publication: 2002
ISBN ~ ISSN:0738-100X , 1-58113-461-4
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 56, Citation Count: 9
|
|
|
ABSTRACT
In this paper, we present a compiler strategy to optimize data accesses in regular array-intensive applications running on embedded multiprocessor environments. Specifically, we propose an optimization algorithm that targets the reduction of extra off-chip memory accesses caused by inter-processor communication. This is achieved by increasing the application-wide reuse of data that resides in the scratch-pad memories of processors. Our experimental results obtained on four array-intensive image processing applications indicate that exploiting inter-processor data sharing can reduce the energy-delay product by as much as 33.8% (and 24.3% on average) on a four-processor embedded system. The results also show that the proposed strategy is robust in the sense that it gives consistently good results over a wide range of several architectural parameters.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
| |
2
|
S. Amarasinghe, J. Anderson, M. Lam, and C. Tseng. The SUIF compiler for scalable parallel machines. In Proc. 7th SIAM Conference on Parallel Processing for Scientific Computing, 1995.
|
| |
3
|
|
| |
4
|
|
| |
5
|
Blue Wave Systems. http://www.bluews.com/
|
 |
6
|
Rajesh Bordawekar , Alok Choudhary , J. Ramanujam, Automatic optimization of communication in compiling out-of-core stencil codes, Proceedings of the 10th international conference on Supercomputing, p.366-373, May 25-28, 1996, Philadelphia, Pennsylvania, United States
[doi> 10.1145/237578.237638]
|
| |
7
|
|
| |
8
|
|
| |
9
|
Dinero IV Trace-Driven Uniprocessor Cache Simulator. URL: http://www.cs.wisc.edu/~markhill/DineroIV/
|
| |
10
|
|
| |
11
|
|
| |
12
|
R. Gonzales and M. Horowitz. Energy dissipation in general purpose microprocessors. IEEE Journal of Solid-State Circuits, pages 1277--1284, Sep. 1996
|
 |
13
|
M. Kandemir , N. Vijaykrishnan , M. J. Irwin , W. Ye, Influence of compiler optimizations on system power, Proceedings of the 37th conference on Design automation, p.304-307, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337425]
|
 |
14
|
M. Kandemir , J. Ramanujam , J. Irwin , N. Vijaykrishnan , I. Kadayif , A. Parikh, Dynamic management of scratch-pad memory space, Proceedings of the 38th conference on Design automation, p.690-695, June 2001, Las Vegas, Nevada, United States
[doi> 10.1145/378239.379049]
|
 |
15
|
David J. Kolson , Alexandru Nicolau , Nikil Dutt, Minimization of memory traffic in high-level synthesis, Proceedings of the 31st annual conference on Design automation, p.149-154, June 06-10, 1994, San Diego, California, United States
[doi> 10.1145/196244.196316]
|
| |
16
|
Mobile Computing Devices: A New Era in Personal Computing, August 2000. Computer Market Dynamics. Report Number CMC00-005MC.
|
| |
17
|
|
| |
18
|
|
 |
19
|
Wen-Tsong Shiue , Chaitali Chakrabarti, Memory exploration for low power, embedded systems, Proceedings of the 36th ACM/IEEE conference on Design automation, p.140-145, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.309902]
|
 |
20
|
N. Vijaykrishnan , M. Kandemir , M. J. Irwin , H. S. Kim , W. Ye, Energy-driven integrated hardware-software optimizations using SimplePower, Proceedings of the 27th annual international symposium on Computer architecture, p.95-106, June 2000, Vancouver, British Columbia, Canada
|
| |
21
|
|
| |
22
|
|
| |
23
|
Y. Zhang, Y. Chen, W. Ye, and M. J. Irwin. System-level interconnect power modeling. In Proc. the 11th International ASIC Conference, 1998.
|
CITED BY 9
|
|
Federico Angiolini , Luca Benini , Alberto Caprara, Polynomial-time algorithm for on-chip scratchpad memory partitioning, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
|
|
|
Federico Angiolini , Francesco Menichelli , Alberto Ferrero , Luca Benini , Mauro Olivieri, A post-compiler approach to scratchpad mapping of code, Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems, September 22-25, 2004, Washington DC, USA
|
|
|
O. Ozturk , M. Kandemir , I. Demirkiran , G. Chen , M. J. Irwin, Data compression for improving SPM behavior, Proceedings of the 41st annual conference on Design automation, June 07-11, 2004, San Diego, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|