ACM Home Page
Please provide us with feedback. Feedback
Thread-Level Speculation on a CMP can be energy efficient
Full text PdfPdf (370 KB)
Source International Conference on Supercomputing archive
Proceedings of the 19th annual international conference on Supercomputing table of contents
Cambridge, Massachusetts
SESSION: Session 6: threads table of contents
Pages: 219 - 228  
Year of Publication: 2005
ISBN:1-59593-167-8
Authors
Jose Renau  University of California, Santa Cruz
Karin Strauss  University of Illinois at Urbana-Champaign
Luis Ceze  University of Illinois at Urbana-Champaign
Wei Liu  University of Illinois at Urbana-Champaign
Smruti Sarangi  University of Illinois at Urbana-Champaign
James Tuck  University of Illinois at Urbana-Champaign
Josep Torrellas  University of Illinois at Urbana-Champaign
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 45,   Citation Count: 0
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1088149.1088178
What is a DOI?

ABSTRACT

Chip Multiprocessors (CMP) with Thread-Level Speculation (TLS) have become the subject of intense research. However, TLS is suspected of being too energy inefficient to compete against conventional processors. In this paper, we refute this claim. To do so, we first identify the main sources of dynamic energy consumption in TLS. Then, we present simple energy-saving optimizations that cut the energy cost of TLS by over 60% on average with minimal performance impact. The resulting TLS CMP, populated with four 3-issue cores, speeds-up full SPECint 2000 codes by 1.27 on average, while keeping the fraction of the chip's energy consumption due to TLS to only 20%. Compared to a 6-issue superscalar at the same frequency, the TLS CMP is on average faster, while consuming only 85% of its total on-chip power.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
International Technology Roadmap for Semiconductors. Semiconductor Industry Association, 2002.
 
2
3
4
5
 
6
 
7
SSA for Trees - GNU Project, May 2003. "http://www.gccsummit. org/2003/view_abstract.php?talk=2".
 
8
9
 
10
 
11
12
13
 
14
A. J. Martin, M. Nystroem, and P. Penzes. ET2: A Metric for Time and Energy Efficiency of Computation. Technical Report CSTR:2001.007, California Institute of Technology, December 2001.
15
 
16
V. Petric and A. Roth. Energy-Effectiveness of Pre-Execution and Energy-Aware P-Thread Selection. Technical Report MS-CIS-03-34, University of Pennsylvania, November 2003.
17
 
18
19
 
20
P. Shivakumar and N. Jouppi. CACTI 3.0: An Integrated Cache Timing, Power and Area Model. Technical Report 2001/2, Compaq Computer Corporation, August 2001.
21
22
 
23
24
 
25
M. Tremblay. MAJC: Microprocessor Architecture for Java Computing. Hot Chips, August 1999.
 
26
 
27
J. Tuck. A Novel Compiler Framework for a Chip-Multiprocessor Architecture with Thread-Level Speculation. Master's thesis, University of Illinois at Urbana-Champaign, 2004.
 
28
 
29
Y. Zhang, D. Parikh, K. Sankaranarayanan, K. Skadron, and M. Stan. HotLeakage: A Temperature-Aware Model of Subthreshold and Gate Leakage for Architects. Technical Report CS-2003-05, University of Virginia, Department of Computer Science, March 2003.

Collaborative Colleagues:
Jose Renau: colleagues
Karin Strauss: colleagues
Luis Ceze: colleagues
Wei Liu: colleagues
Smruti Sarangi: colleagues
James Tuck: colleagues
Josep Torrellas: colleagues