ACM Home Page
Please provide us with feedback. Feedback
Application-specific protocols for user-level shared memory
Full text PdfPdf (1.12 MB)
Source
Conference on High Performance Networking and Computing archive
Proceedings of the 1994 ACM/IEEE conference on Supercomputing table of contents
Washington, D.C.
SESSION: Session 13: scheduling and operating systems table of contents
Pages: 380 - 389  
Year of Publication: 1994
ISBN ~ ISSN:1063-9535 , 0-8186-6605-6
Authors
Babak Falsafi  University of Wisconsin-Madison, Madison, WI
Alvin R. Lebeck  University of Wisconsin-Madison, Madison, WI
Steven K. Reinhardt  University of Wisconsin-Madison, Madison, WI
Ioannis Schoinas  University of Wisconsin-Madison, Madison, WI
Mark D. Hill  University of Wisconsin-Madison, Madison, WI
James R. Larus  University of Wisconsin-Madison, Madison, WI
Anne Rogers  University of Wisconsin-Madison, Madison, WI
David A. Wood  University of Wisconsin-Madison, Madison, WI
Sponsors
IEEE-CS\DATC : IEEE Computer Society
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 5,   Citation Count: 2
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/602770.602838
What is a DOI?

ABSTRACT

Recent distributed shared memory (DSM) systems and proposed shared-memory machines have implemented some or all of their cache coherence protocols in software. One way to exploit the flexibility of this software is to tailor a coherence protocol to match an application's communication patterns and memory semantics. This paper presents evidence that this approach can lead to large performance improvements. It shows that application-specific protocols substantially improved the performance of three application programs--appbt, em3d, and barnes--over carefully tuned transparent shared memory implementations. The speed-ups were obtained on Blizzard, a fine-grained DSM system running on a 32-node Thinking Machines CM-5.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
David Bailey, John Barton, Thomas Lasinski, and Horst Simon. The NAS Parallel Benchmarks. Technical Report RNR-91-002 Revision 2, Ames Research Center, August 1991.
2
3
4
 
5
Matthew I. Frank and Mary K. Vernon. A Hybrid Shared Memory/Message Passing Parallel Machine. In Proceedings of the 1993 International Conference on Parallel Processing (Vol. I Architecture), pages 232-236, August 1993.
6
7
8
 
9
Kendall Square Research. Kendall Square Research Technical Summary, 1992.
10
11
12
 
13
14
 
15
16
17
18
 
19
20
21
22
23
 
24
25
26
 
27
William A Wulf. Compilers and Computer Architecture. IEEE Computer, 14(7):41-47, July 1981.

Collaborative Colleagues:
Babak Falsafi: colleagues
Alvin R. Lebeck: colleagues
Steven K. Reinhardt: colleagues
Ioannis Schoinas: colleagues
Mark D. Hill: colleagues
James R. Larus: colleagues
Anne Rogers: colleagues
David A. Wood: colleagues