ACM Home Page
Please provide us with feedback. Feedback
Efficient and scalable compiler-directed energy optimization for realtime applications
Full text PdfPdf (250 KB)
Source
ACM Transactions on Design Automation of Electronic Systems (TODAES) archive
Volume 12 ,  Issue 3  (August 2007) table of contents
Article No. 27  
Year of Publication: 2007
ISSN:1084-4309
Authors
Po-Kuan Huang  University of California, Davis, CA
Soheil Ghiasi  University of California, Davis, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 64,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1255456.1255464
What is a DOI?

ABSTRACT

With continuing shrinkage of technology feature sizes, the share of leakage in total energy consumption of digital systems continues to grow. Coordinated supply voltage and body bias throttling enables the compiler to better optimize the total energy consumption of the system in future technology nodes. We present a compilation technique that targets realtime applications running on embedded processors with combined dynamic voltage scaling (DVS) and adaptive body biasing (ABB) capabilities. Considering the delay and energy penalty of switching between operating modes of the processor, our compiler judiciously inserts mode-switch instructions in selected locations of the code and generates executable binary that is guaranteed to meet the deadline constraint. More importantly, our algorithm runs very fast and comes reasonably close to the theoretical limit of energy optimization using DVS+ABB. At 65nm technology, we improve the energy dissipation of the generated code by an average of 33.20% under deadline constraints. While our technique's improvement in energy dissipation over conventional DVS is marginal (6.91%) at 130nm, the average improvement continues to grow to 13.19%, 22.97%, and 33.21% for 90nm, 65nm, and 45nm technology nodes, respectively. Compared to a recent ILP-based competitor, we improve the runtime by more than three orders of magnitude, while producing improved results.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
Berkeley-model. http://www-device.eecs.berkeley.edu/~ptm/introduction.html.
 
5
Burd, T., Pering, T., Stratakos, A., and Brodersen, R. 2000. A dynamic voltage scaled microprocessor system. IEEE J. Solid-State Circ. 35, 11 (Nov.), 1571--1580.
 
6
 
7
 
8
9
10
 
11
Kao, J., Miyazaki, M., and Chandrakasan, A. 2002. A 175-mv multiply-accumulate unit using an adaptive supply voltage and body bias architecture. J. Solid-State Circu. 37, 11 (Nov.), 1545--1554.
12
13
14
 
15
Longrun. 2007. http://www.transmeta.com/technology/architecture/longrun.html.
16
17
 
18
 
19
 
20
Smith, M. and Holloway, G. 2002. An introduction to machine suif and its portable libraries for analysis and optimization. Tech. Rep., Division of Engineering and Applied Sciences, Harvard University.
21
22
 
23
Xscale. 2007. http://www.intel.com/design/intelxscale.
 
24
 
25
26
 
27

Collaborative Colleagues:
Po-Kuan Huang: colleagues
Soheil Ghiasi: colleagues