ACM Home Page
Please provide us with feedback. Feedback
Static analysis of processor stall cycle aggregation
Full text PdfPdf (762 KB)
Source
International Conference on Hardware Software Codesign archive
Proceedings of the 6th IEEE/ACM/IFIP international conference on Hardware/Software codesign and system synthesis table of contents
Atlanta, GA, USA
SESSION: Application specific processor systems table of contents
Pages 25-30  
Year of Publication: 2008
ISBN:978-1-60558-470-6
Authors
Jongeun Lee  Arizona State University, Tempe, AZ, USA
Aviral Shrivastava  Arizona State University, Tempe, AZ, USA
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
SIGBED: ACM Special Interest Group on Embedded Systems
ACM: Association for Computing Machinery
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 76,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1450135.1450143
What is a DOI?

ABSTRACT

Processor Idle Cycle Aggregation (PICA) is a promising approach for low power execution of processors, in which small memory stalls are aggregated to create a large one, and the processor is switched to low-power mode in it. We extend the previous proposed approach in two dimensions. i) We develop static analysis for the PICA technique and present optimum parameters for five common types of loops based on steady-state analysis. ii) We show that software only control is unable to guarantee its correctness in a varying runtime environment, potentially causing deadlocks. We enhance the robustness of PICA with minimal hardware extension, ensuring correct execution for any loops and parameters, which greatly facilitates exploration based parameter optimization. The combined use of our static analysis and exploration based fine-tuning makes the PICA technique applicable, to any memory-bound loop, with energy reduction. We validate our analytical models against simulation based optimization and also show through our experiments on embedded application benchmarks, that our technique can be applied to a wide range of loops with average 20% energy reductions compared to executions without PICA.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
K. Choi et al. Fine-grained dynamic voltage and frequency scaling for precise energy and performance tradeoff based on the ratio of off-chip access to on-chip computation times. IEEE Trans. CAD, 24(1):18--28, 2005.
2
3
 
4
5
 
6
Intel Corporation. Intel XScale® Core: Developer's Manual. {Online}. Available: http://www.intel.com/design/intelxscale/273473.htm.
 
7
Intel Corporation. Intel 80200 Processor based on Intel XScale Microarchitecture. {Online}. Available: http://www.intel.com/design/iio/manuals/273411.htm.
 
8
J. Rabaey and M. Pedram, editors. Low Power Design Methodologies. Kluwer Academic Publishers, 1996.
9
10
 
11
 
12
13
14
 
15
V. Zivojnovic et al. DSPstone: A DSP-oriented benchmarking methodology. In ICSPAT, 1994.
 
16
SPEC 95. Standard Performance Evaluation Corporation. http://www.spec.org/.

Collaborative Colleagues:
Jongeun Lee: colleagues
Aviral Shrivastava: colleagues