ACM Home Page
Please provide us with feedback. Feedback
Coupling compiler-enabled and conventional memory accessing for energy efficiency
Full text PdfPdf (1.41 MB)
Source ACM Transactions on Computer Systems (TOCS) archive
Volume 22 ,  Issue 2  (May 2004) table of contents
Pages: 180 - 213  
Year of Publication: 2004
ISSN:0734-2071
Authors
Raksit Ashok  BlueRISC Inc., Hadley, MA
Saurabh Chheda  BlueRISC Inc., Hadley, MA
Csaba Andras Moritz  University of Massachusetts, Amherst, MA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 50,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/986533.986535
What is a DOI?

ABSTRACT

This article presents Cool-Mem, a family of memory system architectures that integrate conventional memory system mechanisms, energy-aware address translation, and compiler-enabled cache disambiguation techniques, to reduce energy consumption in general-purpose architectures. The solutions provided in this article leverage on interlayer tradeoffs between architecture, compiler, and operating system layers. Cool-Mem achieves power reduction by statically matching memory operations with energy-efficient cache and virtual memory access mechanisms. It combines statically speculative cache access modes, a dynamic content addressable memory-based (CAM-based) Tag-Cache used as backup for statically mispredicted accesses, different conventional multilevel associative cache organizations, embedded protection checking along all cache access mechanisms, as well as architectural organizations to reduce the power consumed by address translation in virtual memory. Because it is based on speculative static information, a superset of the predictable program information available at compile-time, our approach removes the burden of provable correctness in compiler analysis passes that extract static information. This makes Cool-Mem highly practical, applicable for large and complex applications, without having any limitations due to complexity issues in our compiler passes or the presence of precompiled static libraries. Based on extensive evaluation, for both SPEC2000 and Mediabench applications, we obtain from 6% to 19% total energy savings in the processor, with performance ranging from 1.5% degradation to 6% improvement, for the applications studied. We have also compared Cool-Mem to several prior arts and have found Cool-Mem to perform better in almost all cases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
Borkar, S., Ye, Y., and De, V. 1998. A technique for standby leakage reduction in high-performance circuits. In Symposium on VLSI Circuits. 40--41.
5
 
6
Burger, D. C. and Austin, T. M. 1997. The SimpleScalar tool set, version 2.0. Tech. rep. CS-TR-1997--1342, University of Wisconsin-Madison, Madison, WI.
 
7
 
8
Chase, J. S., Levy, H. M., Lazowska, E. D., and Baker-Harvey, M. 1992. Lightweight shared objects in a 64-bit operating system. Tech. rep. 92-03-09. University of Washington, Seattle, WA (March).
9
 
10
Cheng, R. 1987. Virtual address cache in Unix. In Proceedings of the 1987 Summer Usenix Conference. 217--224.
11
 
12
 
13
Digital Equipment Corporation. 1997. 21164 Alpha Microprocessor Hardware Reference Manual. Digital Equipment Corporation, Maynard, MA.
14
15
16
17
 
18
19
20
21
 
22
 
23
 
24
25
 
26
Kao, J. T. and Chandrakasan, A. P. 2000. Dual-threshold voltage techniques for low-power digital circuits. IEEE J. Solid-State Circ. 35, 7 (July), 1009--1018.
 
27
 
28
 
29
Kuroda, T., Suzuki, K., Mira, S., Fujita, T., Yamane, F., Sano, F., Akihiko, C., Watanabe, Y., Yoshinori, M., Matsuda, K., Maeda, T., Sakurai, T., and Tohru, F. 1998. Variable supply-voltage scheme for low-power high-speed CMOS digital design. IEEE J. Solid-State Circ. 33, 3 (March), 454--462.
 
30
 
31
Ma, A., Zhang, M., and Asanovic, K. 2001. Way memoization to reduce fetch energy in instruction caches. In Workshop on Complexity Effective Design, 28th International Symposium on Computer Architecture (ISCA '01, July).
 
32
 
33
 
34
Moritz, C. A., Frank, M., Lee, W., and Amarasinghe, S. 1999. Hot Pages: Software caching for raw microprocessors. MIT-LCS Tech. Memo LCS-TM-599. MIT, Cambridge, MA.
 
35
Mutoh, S., Douseki, T., Aoki, Y. M. T., Shingematsu, S., and Yamada, J. 1995. 1-V power supply high-speed digital circuit technology with multi-threshold CMOS technology. IEEE J. Solid-State Circ. 30, 8 (Aug.), 847--854.
 
36
37
 
38
 
39
Reinman, G. and Jouppi, N. 2000. An integrated cache timing and power model. Compaq WRL Res. rep. 2000/70 Compaq Computer Corporation Western Research Laboratory, Palo Alto, CA.
 
40
Sair, S. and Charney, M. 2000. Memory behaviour of the SPEC2000 benchmark suite. IBM T. J. Watson Research Center technical report. IBM T. J. Watson Research Center, Yorktown Heights, NY.
 
41
Scott, M. L., LeBlanc, T. J., and Marsh, B. D. 1988. Design rationale for Psyche, a general-purpose multiprocessor operating system. In Proceedings of the 1988 International Conference on Parallel Processing.
 
42
Shigematsu, S. et al. 1997. A 1-V high-speed MTCMOS circuit scheme for power-down application circuits. IEEE J. Solid-State Circ. 32, 6 (June), 861--869.
43
 
44
 
45
Unsal, O. S., Koren, I., Krishna, C. M., and Moritz, C. A. 2002. Cool-Fetch: Compiler-enabled power-aware fetch throttling. In IEEE Comput. Architect. Lett. 1.
46
47
48
 
49
50
 
51
Zhang, M. and Asanovic, K. 2000. Highly-associative caches for low-power processors. In Kool Chips Workshop, 33rd Annual Symposium on Microarchitecture (MICRO '00, Dec.).

Collaborative Colleagues:
Raksit Ashok: colleagues
Saurabh Chheda: colleagues
Csaba Andras Moritz: colleagues