ACM Home Page
Please provide us with feedback. Feedback
Dynamically configurable shared CMP helper engines for improved performance
Full text PdfPdf (290 KB)
Source ACM SIGARCH Computer Architecture News archive
Volume 33 ,  Issue 4  (November 2005) table of contents
Special issue: dasCMP'05
SPECIAL ISSUE: Special issue: dasCMP'05 table of contents
Pages: 70 - 79  
Year of Publication: 2005
ISSN:0163-5964
Authors
Anahita Shayesteh  University of California, Los Angeles
Glenn Reinman  University of California, Los Angeles
Norman Jouppi  HP Labs, Palo Alto
Suleyman Sair  North Carolina State University
Tim Sherwood  University of California, Santa Barbara
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 27,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1105734.1105744
What is a DOI?

ABSTRACT

Technology scaling trends have forced designers to consider alternatives to deeply pipelining aggressive cores with large amounts of performance accelerating hardware. One alternative is a small, simple core that can be augmented with latency tolerant helper engines. As the demands placed on the processor core varies between applications, and even between phases of an application, the benefit seen from any set of helper engines will vary tremendously. If there is a single core, these auxiliary structures can be turned on and off dynamically to tune the energy/performance of the machine to the needs of the running application.As more of the processor is broken down into helper engines, and as we add more and more cores onto a single chip which can potentially share helpers, the decisions that are made about these structures become increasingly important. In this paper we describe the need for methods that effectively manage these helper engines. Our counter-based approach can dynamically turn off 3 helpers on average, while staying within 2% of the performance when running with all helpers. In a multicore environment, our intelligent and flexible sharing of helper engines, provides an average 24% speedup over static sharing in conjoined cores. Furthermore we show benefit from constructively sharing helper engines among multiple cores running the same application.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
D. C. Burger and T. M. Austin. The simplescalar tool set, version 2.0. Technical Report CS-TR-97-1342, U. of Wisconsin, Madison, June 1997.
 
5
R. Dolbeau and A. Seznec. Cash: Revisiting hardware sharing in single-chip parallel processor. Technical Report IRISA Report 1491, IRISA, November 2002.
 
6
7
 
8
 
9
 
10
 
11
12
13
14
15
 
16
P. Shivakumar and Norman P. Jouppi. Cacti 3.0: An integrated cache timing, power, and area model. In Technical Report, 2001.
 
17
18
19
20
 
21
22
 
23
24


Collaborative Colleagues:
Anahita Shayesteh: colleagues
Glenn Reinman: colleagues
Norman Jouppi: colleagues
Suleyman Sair: colleagues
Tim Sherwood: colleagues