ACM Home Page
Please provide us with feedback. Feedback
Stack oriented data cache filtering
Full text PdfPdf (599 KB)
Source
International Conference on Hardware Software Codesign archive
Proceedings of the 7th IEEE/ACM international conference on Hardware/software codesign and system synthesis table of contents
Grenoble, France
SESSION: Embedded system optimization across memory hierarchy table of contents
Pages 257-266  
Year of Publication: 2009
ISBN:978-1-60558-628-1
Authors
Rodrigo González-Alberquilla  Universidad Complutense de Madrid, Madrid, Spain
Fernando Castro  Universidad Complutense de Madrid, Madrid, Spain
Luis Piñuel  Universidad Complutense de Madrid, Madrid, Spain
Francisco Tirado  Universidad Complutense de Madrid, Madrid, Spain
Sponsors
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
SIGDA: ACM Special Interest Group on Design Automation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 15,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1629435.1629472
What is a DOI?

ABSTRACT

The L1 data cache is one of the most frequently accessed structures in the processor. Because of this and its moderate size it is a major consumer of power. In order to reduce its power consumption, in this paper a small filter structure that exploits the special features of the references to the stack region is proposed. This filter, which acts as a top -non-inclusive- level of the data memory hierarchy, consists of a register set that keeps the data stored in the neighborhood of the top of the stack. Our simulation results show that using a small Stack Filter (SF) of only a few registers, 15% to 30% data cache power savings can be achieved on average, with a negligible performance penalty.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Viredaz, M.A., Wallach, D.A.: Power evaluation of a handheld computer. IEEE Micro 23(1) (January 2003) 66--74
 
2
Montanaro, J., et al.: A 160 Mhz, 32-b, 0.5W CMOS RISC Microprocessor. Digital Technology Journal 9(1) (1997) 49--62
 
3
Ghose, K., Kamble, M.B.: Reducing Power in Superescalar Processors Caches using Su-banking, Multiple Line Buffers and Bit-Line Segmentation. In: International Symposium on Low-Power Electronics and Design. (August 1999) 70--75
 
4
Su, C.L., Despain, A.M.: Cache Designs for Energy-Efficiency. In: Hawaii International Conference on Systems Sciences. (January 1995) 306--314
 
5
Racunas, P., Patt, Y.N.: Partitioned First-Level Cache Design for Clustered Microarchitectures. In: International Conference on Supercomputing, San Francisco, California (June 2003) 22---31
 
6
Kin, J., Gupta, M., Mangione-Smith, W.: The Filter Cache: An Energy Efficient Memory Structure. In: International Symposium on Microarchitecture, Research Triangle Park, North Carolina (December1997) 184--193
 
7
Albonesi, D.: Selective Cache Ways: On-Demand Cache Resource Allocation. Journal of Instruction-Level Parallelism 2 (2000)
 
8
Lee, L.H., Moyer, B., Arends, J.: Instruction Fetch Energy Reduction Using Loop Caches for Embedded Applications with Small Tight Loops. In: International Symposium on Low-Power Electronics and Design, San Diego, California (August 1999) 267--269
 
9
Lee, H., Smelyanskiy, M., Newburn, C., Tyson, G.: Stack Value File: Custom Microarchitecture for the Stack. In: International Symposium on High-Performance Computer Architecture, Monterey, Mexico (January 2001) 5--14
 
10
Cho, S., Yew, P., Lee, G.: Decoupling Local Variable Accesses in a Wide-Issue Superscalar Processor. In: International Symposium on Computer Architecture, Atlanta, Georgia (May 1999) 100--110
 
11
Ward, Halstead: Computation Structures. Kluwer Academics (2002)
 
12
Ballapuram, C.S., Sharif, A., Lee, H.S.: Exploiting Access Semantics and Program Behavior to Reduce Snoop Power in Chip Multiprocessors. In:
 
13
Architectural Support for Programming Languages and Operating Systems, Seattle, Washington (March 2008) 60--69
 
14
Mudge, T., Austin, T., Grunwald, D., et al.: A SimpleScalar-Arm Power Modeling Project http://www.eecs.umich.edu/Üpanalyzer.
 
15
Austin, T., Larson, E., Ernst, D.: SimpleScalar: An Infrastructure for Computer System Modeling. IEEE Computer 39(2) (February 2002) 59--67
 
16
Guthaus, M.R., Ringenberg, J.S., Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: Mibench:a Free, Commercially Representative Embedded Benchmark Suite. In: Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop, Washington (December 2001) 3--14
 
17
Kim, N.S., Austin, T., Mudge, T., Grunwald, D.: Challenges for Architectural Level Power Modeling. Kluwer Academics (2002)
 
18
Thoziyoor, S., Muralimanohar, N., Ahn, J., Jouppi, N.: CACTI 5.1. Technical report, HP Laboratories, Palo Alto (April 2008)
 
19
Ditzel, D.R., McLellan, H.R.: Register Allocation for Free: The C Machine Stack Cache. SIGARCH Computer Architecture News 10(2) (1982) 48--56
 
20
Huang, M., Renau, J., Torrellas, J.: L1 Data Cache Decomposition for Energy Efficiency. In: International Symposium on Low-Power Electronics and Design, Huntington Beach, California (August 2001) 10--15
 
21
Geiger, M., McKee, S., Tyson, G.: Beyond Basic Region Caching: Specilalizing Cache Structures for High Performance and Energy Consevation. Hipeac 2005 (2005)
 
22
Hemsath, A., Morton, R., Sjodin, J.: Implementing a Stack Cache. Technical report, Rice University (June 2007)