ACM Home Page
Please provide us with feedback. Feedback
Software controlled memory layout reorganization for irregular array access patterns
Full text PdfPdf (521 KB)
Source
International Conference on Compilers, Architecture and Synthesis for Embedded Systems archive
Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems table of contents
Salzburg, Austria
SESSION: Memory systems table of contents
Pages: 179 - 188  
Year of Publication: 2007
ISBN:978-1-59593-826-8
Authors
Doosan Cho  Seoul National University
Ilya Issenin  University of California: Irvine
Nikil Dutt  University of California: Irvine
Jonghee W. Yoon  Seoul National University
Yunheung Paek  Seoul National University
Sponsors
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
SIGDA: ACM Special Interest Group on Design Automation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 68,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1289881.1289915
What is a DOI?

ABSTRACT

Many embedded array-intensive applications have irregular access patterns that are not amenable to static analysis for extraction of access patterns, and thus prevent efficient use of a Scratch Pad Memory (SPM) hierarchy for performance and power improvement. We present a profiling based strategy that generates a memory access trace which can be used to identify data elements with fine granularity that can profitably be placed in the SPMs to maximize performance and energy gains. We developed an entire toolchain that allows incorporation of the code required to profitably move data to SPMs; visualization of the extracted access pattern after profiling; and evaluation/exploration of the generated application code to steer mapping of data to the SPM to yield performance and energy benefits.We present a heuristic approach that efficiently exploits the SPM using the profiler-driven access pattern behaviors. Experimental results on EEMBC and other industrial codes obtained with our framework show that we are able to achieve 36% energy reduction and reduce execution time by up to 22% compared to a cache based system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
D. Burger and T. M. Austin. The SimpleScalar tool set, version 2.0. InTechnical Report 1342, University of Wisconsin-Madison, CS Department, June 1997.
5
 
6
 
7
M. Co, Dee A. B. Weikle, and K. Skadron A Break-Even Formulation for Evaluating Branch Predictor Energy Efficiency. In Proceedings of Complexity-Effective Design, 2005.
8
 
9
 
10
11
 
12
I. Issenin, and N. Dutt. Data Reuse Driven Memory and Network-on-Chip Co-Synthesis. In Proceedings of IESS, 2007.
13
14
 
15
16
 
17
 
18
S. Lin, D. Costello. Error Control Coding: Fundamentals and Applications. Book, Practice Hall, October 1982.
19
 
20
21
 
22
 
23
 
24
P. Shivakumar, N. Jouppi. CACTI 3.0: An Integrated Cache Timing, Power, and Area Model. WRL Technical Report, August 2001.
25
26
27
28
 
29
EDN Embedded Microprocessor Benchmark Consortium, www.eembc.org.
 
30
Intel Corporation, http://developer.intel.com/design/intelxscale/index.htm.


Collaborative Colleagues:
Doosan Cho: colleagues
Ilya Issenin: colleagues
Nikil Dutt: colleagues
Jonghee W. Yoon: colleagues
Yunheung Paek: colleagues