|
ABSTRACT
With large-scale integration and increasing power densities, thermal management has become an important tool to maintain performance and reliability in modern process technologies. In the core of dynamic thermal management schemes lies accurate reading of on-die temperatures. Therefore, careful planning and embedding of thermal monitoring mechanisms into high-performance systems becomes crucial. In this paper, we propose three techniques to create sensor infrastructures for monitoring the maximum temperature on a multicore system. Initially, we extend a nonuniform sensor placement methodology proposed in the literature to handle chip multiprocessors (CMPs) and show its limitations. We then analyze a grid-based approach where the sensors are placed on a static grid covering each core and show that the sensor readings can differ from the actual maximum core temperature by as much as 12.6°C when using 16 sensors per core. Also, as large as 10.6% of the thermal emergencies are not captured using the same number of sensors. Based on this observation, we first develop an interpolation scheme, which estimates the maximum core temperature through interpolation of the readings collected at the static grid points. We show that the interpolation scheme improves the measurement accuracy and emergency coverage compared to grid-based placement when using the same number of sensors. Second, we present a dynamic scheme where only a subset of the sensor readings is collected to predict the maximum temperature of each core. Our results indicate that, we can reduce the number of active sensors by as much as 50%, while maintaining similar measurement accuracy and emergency coverage compared to the case where the entire sensor set on the grid is sampled at all times.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Nathan L. Binkert , Ronald G. Dreslinski , Lisa R. Hsu , Kevin T. Lim , Ali G. Saidi , Steven K. Reinhardt, The M5 Simulator: Modeling Networked Systems, IEEE Micro, v.26 n.4, p.52-60, July 2006
[doi> 10.1109/MM.2006.82]
|
| |
2
|
Blackburn, D. L. 2004. Temperature measurements of semiconductor devices -- A Review. In Semiconductor Thermal Measurement, Modeling and Management Symposium.
|
| |
3
|
|
 |
4
|
|
 |
5
|
Shekhar Borkar , Tanay Karnik , Siva Narendra , Jim Tschanz , Ali Keshavarzi , Vivek De, Parameter variations and impact on circuits and microarchitecture, Proceedings of the 40th conference on Design automation, June 02-06, 2003, Anaheim, CA, USA
[doi> 10.1145/775832.775920]
|
| |
6
|
Borkar, S. et al. 2005. Platform 2015: Intel processor and platform evolution for the next decade. Whitepaper.
|
| |
7
|
Bratek, P. and Kos, A. 2001. Temperature sensors placement strategy for fault diagnosis in integrated circuits. In Symposium on Semiconductor Thermal Measurement and Management.
|
| |
8
|
|
| |
9
|
Dorsey, J. et al. 2007. An integrated quad-core opteron processor. In International Solid-State Circuits Conference.
|
| |
10
|
Gunther, S. et al. 2001. Managing the impact of increasing microprocessor power consumption. Intel Tech. J.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
Wei Huang , Mircea R. Stan , Kevin Skadron , Karthik Sankaranarayanan , Shougata Ghosh , Sivakumar Velusam, Compact thermal modeling for temperature-aware design, Proceedings of the 41st annual conference on Design automation, June 07-11, 2004, San Diego, CA, USA
[doi> 10.1145/996566.996800]
|
 |
15
|
Michael Huang , Jose Renau , Seung-Moon Yoo , Josep Torrellas, A framework for dynamic energy efficiency and temperature management, Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture, p.202-213, December 2000, Monterey, California, United States
[doi> 10.1145/360128.360149]
|
 |
16
|
Philo Juang , Qiang Wu , Li-Shiuan Peh , Margaret Martonosi , Douglas W. Clark, Coordinated, distributed, formal energy management of chip multiprocessors, Proceedings of the 2005 international symposium on Low power electronics and design, August 08-10, 2005, San Diego, CA, USA
[doi> 10.1145/1077603.1077637]
|
 |
17
|
Stefanos Kaxiras , Polychronis Xekalakis, 4T-decay sensors: a new class of small, fast, robust, and low-power, temperature/leakage sensors, Proceedings of the 2004 international symposium on Low power electronics and design, August 09-11, 2004, Newport Beach, California, USA
[doi> 10.1145/1013235.1013268]
|
| |
18
|
|
| |
19
|
Krinitsin, V. Pentium 4 and Athlon XP: Thermal Conditions. Available from http://www.digit-life.com/articles/pentium4athlonxpthermalmanagement/.
|
 |
20
|
|
| |
21
|
|
| |
22
|
Yingmin Li , David Brooks , Zhigang Hu , Kevin Skadron, Performance, Energy, and Thermal Considerations for SMT and CMP Architectures, Proceedings of the 11th International Symposium on High-Performance Computer Architecture, p.71-82, February 12-16, 2005
[doi> 10.1109/HPCA.2005.25]
|
| |
23
|
Yongpan Liu , Robert P. Dick , Li Shang , Huazhong Yang, Accurate temperature-dependent integrated circuit leakage power estimation is easy, Proceedings of the conference on Design, automation and test in Europe, April 16-20, 2007, Nice, France
|
| |
24
|
Lopez-Buedo, S., Garrido, J., and Boemo, E. I. 2002. Dynamically inserting, operating, and eliminating thermal sensors of FPGA-based systems. IEEE Trans. Components Packaging Tech. 25, 4, 561--566.
|
| |
25
|
Mondal, S., Mukherjee, R., and Memik, S. O. 2006. Fine-grain thermal profiling and sensor insertion for FPGAs. In IEEE International Symposium on Circuits and Systems.
|
 |
26
|
|
 |
27
|
|
| |
28
|
Pham, D. et al. 2005. The design and implementation of a first generation cell processor. In International Solid-State Circuits Conference.
|
 |
29
|
|
| |
30
|
Poirier, C. et al. 2005. Power and temperature control on a 90nm itanium-family processor. In International Solid-State Circuits Conference.
|
| |
31
|
Quenot, G. M., Paris, N., and Zavidovique, B. 1991. A Temperature and voltage measurement cell for VLSI circuits. In Euro ASIC Conference.
|
| |
32
|
Rattner, J. R. 2005. Keynote at the intel developer conference. Available from http://www.intel.com/technology/techresearch/idf/platform-2015-keynote.htm.
|
| |
33
|
Rotem, E. et al. 2004. Analysis of thermal monitor features of the Intel Pentium M processor. In Workshop on Temperature-aware Computer Systems.
|
| |
34
|
Rotem, E. et al. 2006. Temperature measurement in the Intel core duo processor. In International Workshop on Thermal Investigations of ICs.
|
| |
35
|
Sankaranarayanan, K. et al. 2005. A case for thermal-aware floorplanning at the microarchitectural level. J. Instruction-Level Parallelism. 7, 1--16.
|
| |
36
|
|
 |
37
|
Kevin Skadron , Mircea R. Stan , Wei Huang , Sivakumar Velusamy , Karthik Sankaranarayanan , David Tarjan, Temperature-aware microarchitecture, Proceedings of the 30th annual international symposium on Computer architecture, June 09-11, 2003, San Diego, California
[doi> 10.1145/859618.859620]
|
| |
38
|
SPEC-CPU2000. 2000. Standard Performance Evaluation Council, Performance Evaluation in the New Millennium, Version 1.1.
|
 |
39
|
Jayanth Srinivasan , Sarita V. Adve , Pradip Bose , Jude A. Rivers, The Case for Lifetime Reliability-Aware Microprocessors, Proceedings of the 31st annual international symposium on Computer architecture, p.276, June 19-23, 2004, München, Germany
|
| |
40
|
Tuthill, M. 1998. A switched-current, switched-capacitor temperature sensor in 0.6 um CMOS. IEEE J. Solid-State Circuits. 33, 7, 1117--1122.
|
| |
41
|
Tsai, J., Chen, C. C., Chen, G., Goplen, B., Qian, H., Zhan, Y., Kang, S., Wong, M. D. F., and Sapatnekar, S. S. 2006. Temperature-aware placement for SOCs. In Proceedings of the IEEE. 94, 8 (Aug.), 1502--1518.
|
| |
42
|
|
| |
43
|
Wang, N., Zhang, S., and Zhou, R. 2003. A novel built-in CMOS sensor for on-line thermal monitoring of VLSI circuits. In International Conference on ASIC.
|
|