| Word-interleaved cache: an energy efficient data cache architecture |
| Full text |
Pdf
(2.21 MB)
|
Source
|
International Symposium on Low Power Electronics and Design
archive
Proceeding of the thirteenth international symposium on Low power electronics and design
table of contents
Bangalore, India
SESSION: Memory systems & special-purpose hardware
table of contents
Pages 265-270
Year of Publication: 2008
ISBN:978-1-60558-109-5
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 110, Citation Count: 0
|
|
|
ABSTRACT
We propose a novel energy-efficient data cache architecture, namely, word-interleaved (WI) cache. In theWI cache, a cache block is distributed uniformly among the different cache ways and each line of a cache way holds some words of the block. This distribution provides an opportunity to activate/deactivate the cache ways based on the requested address's offset, thus minimizing the overall cache access energy. For a 4-way set associative cache of size 16KB and blocksize 32B, the proposed technique accomplishes dynamic energy savings of 54.2% without considering fast hits and 62.3% when fast hits are considered, with small performance degradation and negligible area overhead.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
SimpleScalar toolset. http://www.simplescalar.com.
|
| |
2
|
SPEC 2000 Benchmark. http://www.spec.org.
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
 |
9
|
Kanad Ghose , Milind B. Kamble, Reducing power in superscalar processor caches using subbanking, multiple line buffers and bit-line segmentation, Proceedings of the 1999 international symposium on Low power electronics and design, p.70-75, August 16-17, 1999, San Diego, California, United States
[doi> 10.1145/313817.313860]
|
| |
10
|
|
| |
11
|
Atsushi Hasegawa , Ikuya Kawasaki , Kouji Yamada , Shinichi Yoshioka , Shumpei Kawasaki , Prasenjit Biswas, SH3: High Code Density, Low Power, IEEE Micro, v.15 n.6, p.11-19, December 1995
[doi> 10.1109/40.476254]
|
| |
12
|
|
 |
13
|
Michael Huang , Jose Renau , Seung-Moon Yoo , Josep Torrellas, L1 data cache decomposition for energy efficiency, Proceedings of the 2001 international symposium on Low power electronics and design, p.10-15, August 2001, Huntington Beach, California, United States
[doi> 10.1145/383082.383086]
|
 |
14
|
Koji Inoue , Tohru Ishihara , Kazuaki Murakami, Way-predicting set-associative cache for high performance and low energy consumption, Proceedings of the 1999 international symposium on Low power electronics and design, p.273-275, August 16-17, 1999, San Diego, California, United States
[doi> 10.1145/313817.313948]
|
| |
15
|
|
 |
16
|
|
 |
17
|
|
| |
18
|
Johnson Kin , Munish Gupta , William H. Mangione-Smith, The filter cache: an energy efficient memory structure, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.184-193, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
 |
19
|
|
| |
20
|
J. Montanaro, et.al., "A 160 MHz, 32b 0.5W CMOS RISC microprocessor". International Solid-State Circuits Conference, 1996, pp. 214--215.
|
 |
21
|
|
| |
22
|
P. Petrov and A. Orailoglu, "Tag compression for low power in dynamically customizable embedded processors". IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 23(7), 2004, pp. 1031--1047.
|
| |
23
|
D. Tarjan, S. Thoziyoor and N. Jouppi, "CACTI 4.0: An Integrated Cache Timing, Power and Area Model", Technical report, HP Laboratories Palo Alto, June 2006.
|
 |
24
|
|
 |
25
|
|
 |
26
|
|
| |
27
|
|
|