| APEX: access pattern based memory architecture exploration |
| Full text |
Pdf
(377 KB)
|
| Source
|
International Symposium on Systems Synthesis
archive
Proceedings of the 14th international symposium on Systems synthesis
table of contents
Montréal, P.Q., Canada
Session: Memory optimization methodologies
table of contents
Pages: 25 - 32
Year of Publication: 2001
ISBN:1-58113-418-5
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 22, Citation Count: 6
|
|
|
ABSTRACT
Memory accesses represent a major bottleneck in embedded systems power and performance. Traditionally, designers tried to alleviate this problem by relying on a simple cache hierarchy, or a limited use of special purpose memory modules such as stream buffers. Although real-life applications contain a large number of memory references to a diverse set of data structures, a significant percentage of all memory accesses in the application are generated from a few memory instructions that exhibit predictable, well-known access patterns; this creates an opportunity for memory customization, targeting the needs of these access patterns. We present APEX, an approach that extracts, analyzes and clusters the most active access patterns in the application, and aggressively customizes the memory architecture to match the needs of the application, exploring a wide range of cost, performance and power designs. We use a heuristic to prune the design space, guiding the exploration towards the best cost/gain ratios. We present experiments on a set of large real-life benchmarks, showing significant performance improvements for varied cost and power characteristics, allowing the designer to best target the system goals.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Tzi cker Chiueh.Sunder:A programmable hardware prefetch architecture for numerical loops.In Conference on High Performance Networking and Computing 1994.
|
| |
4
|
|
| |
5
|
|
| |
6
|
P.Grun,N.Dutt,and A.Nicolau.Exploring memory architecture through access pattern analysis and clustering. Technical report,University of California,Irvine,2001.
|
 |
7
|
Patrick Hicks , Matthew Walnock , Robert Michael Owens, Analysis of power consumption in memory hierarchies, Proceedings of the 1997 international symposium on Low power electronics and design, p.239-242, August 18-20, 1997, Monterey, California, United States
[doi> 10.1145/263272.263342]
|
| |
8
|
|
 |
9
|
|
| |
10
|
C.Kulkarni.Cache optimization for Multimedia Applications PhD thesis,IMEC,2001.
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
R. H. Patterson , G. A. Gibson , E. Ginting , D. Stodolsky , J. Zelenka, Informed prefetching and caching, Proceedings of the fifteenth ACM symposium on Operating systems principles, p.79-95, December 03-06, 1995, Copper Mountain, Colorado, United States
|
| |
16
|
S.Przybylski.Sorting out the new DRAMs.In Hot Chips Tutorial Stanford,CA,1997.
|
| |
17
|
G.Reinman and N.Jouppi.An integrated cache timing and power model.In Summer Internship Report, COMPAQ Western Research Lab, Palo-Alto 1999.
|
 |
18
|
Amir Roth , Andreas Moshovos , Gurindar S. Sohi, Dependence based prefetching for linked data structures, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.115-126, October 02-07, 1998, San Jose, California, United States
|
| |
19
|
Synopsys Design Compiler.www.synopsys.com
|
 |
20
|
Alexander V. Veidenbaum , Weiyu Tang , Rajesh Gupta , Alexandru Nicolau , Xiaomei Ji, Adapting cache line size to application behavior, Proceedings of the 13th international conference on Supercomputing, p.145-154, June 20-25, 1999, Rhodes, Greece
[doi> 10.1145/305138.305188]
|
 |
21
|
N. Vijaykrishnan , M. Kandemir , M. J. Irwin , H. S. Kim , W. Ye, Energy-driven integrated hardware-software optimizations using SimplePower, Proceedings of the 27th annual international symposium on Computer architecture, p.95-106, June 2000, Vancouver, British Columbia, Canada
|
| |
22
|
|
|