ACM Home Page
Please provide us with feedback. Feedback
Binary translation to improve energy efficiency through post-pass register re-allocation
Full text PdfPdf (191 KB)
Source International Conference On Embedded Software archive
Proceedings of the 4th ACM international conference on Embedded software table of contents
Pisa, Italy
SESSION: Energy-aware systems table of contents
Pages: 74 - 85  
Year of Publication: 2004
ISBN:1-58113-860-1
Authors
Kun Zhang  Georgia Institute of Technology, Atlanta, GA
Tao Zhang  Georgia Institute of Technology, Atlanta, GA
Santosh Pande  Georgia Institute of Technology, Atlanta, GA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 34,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1017753.1017769
What is a DOI?

ABSTRACT

Energy efficiency is rapidly becoming a first class optimization parameter for modern systems. Caches are critical to the overall performance and thus, modern processors (both high and low-end) tend to deploy a cache with large size and high degree of associativity. Due a large size cache power takes up a significant percentage of total system power. One important way to reduce cache power consumption is to reduce the dynamic activities in the cache by reducing the dynamic load-store counts. In this work, we focus on programs that are only available as binaries which need to be improved for energy efficiency. For adapting these programs for energy-constrained devices, we propose a feed-back directed post-pass solution that tries to do register re-allocation to reduce dynamic load/store counts and to improve energy-efficiency. Our approach is based on zero knowledge of original code generator or compiler and performs a post-pass register allocation to get a more power-efficient binary. We attempt to find out the dead as well as unused registers in the binary and then re-allocate them on hot paths to reduce dynamic load/store counts. It is shown that the static code size increase due to our framework is very minimal. Our experiments on SPEC2000 and MediaBench show that our technique is effective. We have seen dynamic spill loads/stores reduction in the data-cache ranging from 0% to 26.4%. Overall, our approach improves the energy-delay product of the program.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
 
5
 
6
T. J. Harvey, Reducing the Impact of Spill Code, Master's Thesis, Rice University, May 1998.
7
 
8
Premkishore Shivakumar and Norman P. Jouppi. CACTI 3.0: An Integrated Cache Timing, Power, and Area Model. WRL research report 2001/2.
9
10
 
11
 
12
 
13
 
14
Don Heller. Rabbit: A Performance Counters Library for Intel/AMD Processors and Linux. http://www.scl.ameslab.gov/Projects/Rabbit/
15
 
16
Simon Segars. Low Power Design Techniques for Microprocessors. Conference Presentation on IEEE International Solid-State Circuits Conference (ISSCC), Feb. 2001.
 
17
 
18
Mach-Suif Backend Compiler, The Machine-Suif 2.1 compiler documentation set. Harvard University, Sep. 2000, http://ececs.harvard.edu/hube/research/machsuif.html.
19
20
21
22
 
23


Collaborative Colleagues:
Kun Zhang: colleagues
Tao Zhang: colleagues
Santosh Pande: colleagues