ACM Home Page
Please provide us with feedback. Feedback
Coherence decoupling: making use of incoherence
Full text PdfPdf (174 KB)
Source Architectural Support for Programming Languages and Operating Systems archive
Proceedings of the 11th international conference on Architectural support for programming languages and operating systems table of contents
Boston, MA, USA
SESSION: Architecture table of contents
Pages: 97 - 106  
Year of Publication: 2004
ISBN:1-58113-804-0
Also published in ...
Authors
Jaehyuk Huh  The University of Texas at Austin
Jichuan Chang  University of Wisconsin-Madison
Doug Burger  The University of Texas at Austin
Gurindar S. Sohi  University of Wisconsin-Madison
Sponsors
SIGPLAN: ACM Special Interest Group on Programming Languages
SIGOPS: ACM Special Interest Group on Operating Systems
SIGARCH: ACM Special Interest Group on Computer Architecture
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 72,   Citation Count: 10
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1024393.1024406
What is a DOI?

ABSTRACT

This paper explores a new technique called coherence decoupling, which breaks a traditional cache coherence protocol into two protocols: a Speculative Cache Lookup (SCL) protocol and a safe, backing coherence protocol. The SCL protocol produces a speculative load value, typically from an invalid cache line, permitting the processor to compute with incoherent data. In parallel, the coherence protocol obtains the necessary coherence permissions and the correct value. Eventually, the speculative use of the incoherent data can be verified against the coherent data. Thus, coherence decoupling can greatly reduce --- if not eliminate --- the effects of false sharing. Furthermore, coherence decoupling can also reduce latencies incurred by true sharing. SCL protocols reduce those latencies by speculatively writing updates into invalid lines, thereby increasing the accuracy of speculation, without complicating the simple, underlying coherence protocol that guarantees correctness.The performance benefits of coherence decoupling are evaluated using a full-system simulator and a mix of commercial and scientific benchmarks. Our results show that 40% to 90% of all coherence misses can be speculated correctly, and therefore their latencies partially or fully hidden. This capability results in performance improvements ranging from 3% to over 16%, in most cases where the latencies of coherence misses have an effect on performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
5
6
 
7
 
8
9
10
 
11
P. N. Glaskowsky. IBM Raises Curtain on Power5. Microprocessor Report, Oct. 14 2003.
12
 
13
14
 
15
IEEE. IEEE Standard for Scalable Coherent Interface (SCI), 1992. IEEE 1596--1992.
 
16
T. Karkhanis and J. Smith. A day in the life of a cache miss. In Proceedings of 2nd Annual Workshop On Memory Performance Issues, May 2002.
 
17
 
18
S. Kaxiras and C. Young. Coherence communication prediction in shared-memory multiprocessors. In Proceedings of the 6th Int. Symp. on High Performance Computer Architecture, pages 156--167, Feb. 2000.
19
20
21
22
 
23
24
25
26
27
 
28
29
30
 
31
32
33
34
 
35
36
 
37
38
39
 
40
41
42
 
43
 
44

CITED BY  10

Collaborative Colleagues:
Jaehyuk Huh: colleagues
Jichuan Chang: colleagues
Doug Burger: colleagues
Gurindar S. Sohi: colleagues