| Binary translation to improve energy efficiency through post-pass register re-allocation |
| Full text |
Pdf
(191 KB)
|
| Source
|
International Conference On Embedded Software
archive
Proceedings of the 4th ACM international conference on Embedded software
table of contents
Pisa, Italy
SESSION: Energy-aware systems
table of contents
Pages: 74 - 85
Year of Publication: 2004
ISBN:1-58113-860-1
|
|
Authors
|
|
Kun Zhang
|
Georgia Institute of Technology, Atlanta, GA
|
|
Tao Zhang
|
Georgia Institute of Technology, Atlanta, GA
|
|
Santosh Pande
|
Georgia Institute of Technology, Atlanta, GA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 34, Citation Count: 1
|
|
|
ABSTRACT
Energy efficiency is rapidly becoming a first class optimization parameter for modern systems. Caches are critical to the overall performance and thus, modern processors (both high and low-end) tend to deploy a cache with large size and high degree of associativity. Due a large size cache power takes up a significant percentage of total system power. One important way to reduce cache power consumption is to reduce the dynamic activities in the cache by reducing the dynamic load-store counts. In this work, we focus on programs that are only available as binaries which need to be improved for energy efficiency. For adapting these programs for energy-constrained devices, we propose a feed-back directed post-pass solution that tries to do register re-allocation to reduce dynamic load/store counts and to improve energy-efficiency. Our approach is based on zero knowledge of original code generator or compiler and performs a post-pass register allocation to get a more power-efficient binary. We attempt to find out the dead as well as unused registers in the binary and then re-allocate them on hot paths to reduce dynamic load/store counts. It is shown that the static code size increase due to our framework is very minimal. Our experiments on SPEC2000 and MediaBench show that our technique is effective. We have seen dynamic spill loads/stores reduction in the data-cache ranging from 0% to 26.4%. Overall, our approach improves the energy-delay product of the program.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
 |
4
|
Peter Bergner , Peter Dahl , David Engebretsen , Matthew O'Keefe, Spill code minimization via interference region spilling, Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation, p.287-295, June 16-18, 1997, Las Vegas, Nevada, United States
|
| |
5
|
|
| |
6
|
T. J. Harvey, Reducing the Impact of Spill Code, Master's Thesis, Rice University, May 1998.
|
 |
7
|
|
| |
8
|
Premkishore Shivakumar and Norman P. Jouppi. CACTI 3.0: An Integrated Cache Timing, Power, and Area Model. WRL research report 2001/2.
|
 |
9
|
|
 |
10
|
Uming Ko , Poras T. Balsara , Ashwini K. Nanda, Energy optimization of multi-level processor cache architectures, Proceedings of the 1995 international symposium on Low power design, p.45-49, April 23-26, 1995, Dana Point, California, United States
[doi> 10.1145/224081.224090]
|
| |
11
|
Johnson Kin , Munish Gupta , William H. Mangione-Smith, The filter cache: an energy efficient memory structure, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.184-193, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
12
|
|
| |
13
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
14
|
Don Heller. Rabbit: A Performance Counters Library for Intel/AMD Processors and Linux. http://www.scl.ameslab.gov/Projects/Rabbit/
|
 |
15
|
|
| |
16
|
Simon Segars. Low Power Design Techniques for Microprocessors. Conference Presentation on IEEE International Solid-State Circuits Conference (ISSCC), Feb. 2001.
|
| |
17
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
| |
18
|
Mach-Suif Backend Compiler, The Machine-Suif 2.1 compiler documentation set. Harvard University, Sep. 2000, http://ececs.harvard.edu/hube/research/machsuif.html.
|
 |
19
|
|
 |
20
|
|
 |
21
|
|
 |
22
|
|
| |
23
|
|
|