ACM Home Page
Please provide us with feedback. Feedback
The case for simple, visible cache coherency
Full text PdfPdf (287 KB)
Source Architectural Support for Programming Languages and Operating Systems archive
Proceedings of the 2008 ACM SIGPLAN workshop on Memory systems performance and correctness: held in conjunction with the Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '08) table of contents
Seattle, Washington
SESSION: Software analysis table of contents
Pages 31-35  
Year of Publication: 2008
ISBN:978-1-60558-049-4
Authors
Robert Kunz  Ambarella, Inc.
Mark Horowitz  Stanford University
Sponsor
SIGPLAN: ACM Special Interest Group on Programming Languages
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 71,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1353522.1353532
What is a DOI?

ABSTRACT

The shared memory research community has proposed many complex communication protocols that aim to eliminate specific performance bottlenecks, while still providing an easy-to-use communication interface. Although tailored protocols can eliminate some bottlenecks that arise in real applications, removing the cause of the bottleneck through software optimizations and bug fixes is cheaper to implement, faster to fix (once found), and requires no additional support by the hardware beyond a simple shared memory interface. In fact, in our experience, the choice of coherence protocol is much less important than providing an efficient hardware feedback that indentifies the source of the problem. Future cache-coherence research should focus efforts on illuminating memory system behavior, providing smarter tools to identify bottlenecks, and helping to eliminate them through software optimizations.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
 
5
6
 
7
 
8
 
9
R. Kalla, B. Sinharoy, and J. Tendler. Simultaneous Multithreading Implementations in POWER5---IBM's Next Generation POWER Microprocessor. In Hot Chips 15, August 2003.
 
10
S. Kapil. Gemini: A Power-efficient Chip Multi-Threaded (CMT) UltraSPARC Processor. In Hot Chips 15, August 2003.
 
11
P. Kongetiraer. A 32-way Multithreaded SPARC processor. In Hot Chips 16, August 2004.
 
12
R. Kunz. Performance Bottlenecks on Large-Scale Shared-Memory Multiprocessors. PhD thesis, Stanford University, 2005.
13
14
15
16
 
17
 
18
A. Nowatzyk et al. S3.mp: A Multiprocessor in a Matchbox. In Proceedings of PASA, June 1993.
19
 
20
21
22
23

Collaborative Colleagues:
Robert Kunz: colleagues
Mark Horowitz: colleagues