|
ABSTRACT
In this paper, we present a circuit technique that supports a super-drowsy mode with a single-V DD . In addition, we perform a detailed working set analysis for various cache line update policies for placing lines in a drowsy state. The analysis presents a policy for an instruction cache and shows it is as good as or better than more complex schemes proposed in the past. Furthermore, as an alternative to using high-threshold devices to reduce the bitline leakage through access transistors in drowsy caches, we propose a gated bitline precharge technique. A single threshold process is now sufficient. The gated precharge employs a simple but effective predictor that almost completely hides any performance loss incurred by the transitions between sub-banks. A 64-entry predictor with 3 bits per entry reduces the run-time increase by 78%, which is as effective as previous proposals that used content addressable predictors with 40 bits per entry. Overall, the combination of the proposed techniques reduces the leakage power by 72% with negligible (0.4%) run-time increase.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
T. Douseki, et al., "A 0.5-1V MTCMOS/SIMOX SRAM macro with multi-V TH memory cells," IEEE SOI Conf., 2000.
|
 |
2
|
Koji Nii , Hiroshi Makino , Yoshiki Tujihashi , Chikayoshi Morishima , Yasushi Hayakawa , Hiroyuki Nunogami , Takahiko Arakawa , Hisanori Hamano, A low power SRAM using auto-backgate-controlled MT-CMOS, Proceedings of the 1998 international symposium on Low power electronics and design, p.293-298, August 10-12, 1998, Monterey, California, United States
[doi> 10.1145/280756.280939]
|
| |
3
|
Faith Hamzaoglu , Yibin Ye , Ali Keshavarzi , Kevin Zhang , Siva Narendra , Shekhar Borkar , Mircea Stan , Vivek De, Analysis of dual-Vt SRAM cells with full-swing single-ended bit line sensing for on-chip cache, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, v.10 n.2, p.91-95, April 2002
[doi> 10.1109/92.994983]
|
 |
4
|
|
| |
5
|
K. Flautner, et al., "Drowsy caches," ISCA, 2002.
|
| |
6
|
N. Kim, et al., "Drowsy instruction caches," MICRO, 2002.
|
| |
7
|
|
 |
8
|
J. S. Hu , A. Nadgir , N. Vijaykrishnan , M. J. Irwin , M. Kandemir, Exploiting program hotspots and code sequentiality for instruction cache leakage management, Proceedings of the 2003 international symposium on Low power electronics and design, August 25-27, 2003, Seoul, Korea
[doi> 10.1145/871506.871606]
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
CITED BY 10
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bo Zhai , Ronald G. Dreslinski , David Blaauw , Trevor Mudge , Dennis Sylvester, Energy efficient near-threshold chip multi-processing, Proceedings of the 2007 international symposium on Low power electronics and design, August 27-29, 2007, Portland, OR, USA
|
|
|
|
|
|
Ismail Kadayif , Ayhan Zorlubas , Selcuk Koyuncu , Olcay Kabal , Davut Akcicek , Yucel Sahin , Mahmut Kandemir, Capturing and optimizing the interactions between prefetching and cache line turnoff, Microprocessors & Microsystems, v.32 n.7, p.394-404, October, 2008
|
|
|
Ronald G. Dreslinski , Gregory K. Chen , Trevor Mudge , David Blaauw , Dennis Sylvester , Krisztian Flautner, Reconfigurable energy efficient near threshold cache architectures, Proceedings of the 2008 41st IEEE/ACM International Symposium on Microarchitecture, p.459-470, November 08-12, 2008
|
|