|
ABSTRACT
Scalable shared-memory multiprocessors are the subject of much current research, but little is known about the performance behavior of these machines. This paper studies the performance effects of two machine characteristics and two program characteristics that seem to be major factors in determining the performance of a hierarchical shared-memory machine. We develop an analytical model of the traffic in a machine loosely based on Stanford's DASH multiprocessor and use program parameters extracted from multiprocessor traces to study its performance. It is shown that both locality in the data reference stream and the amount of data sharing in a program have an important impact on performance. Although less obvious, the bandwidth within each cluster in the hierarchy also has a significant performance effect. Optimizations that improve the intracluster cache coherence protocol or increase the bandwidth within a cluster can be quite effective.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
A. Agarwal , R. Simoni , J. Hennessy , M. Horowitz, An evaluation of directory schemes for cache coherence, Proceedings of the 15th Annual International Symposium on Computer architecture, p.280-298, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
| |
2
|
L.M. Censier and E Feautrier. A New Solution to Coherence Problems in Multicache Systems. In IEEE Trans. on Computers, pages 1112- 1118, December 1978.
|
 |
3
|
|
 |
4
|
|
| |
5
|
A. Gottlieb, R. Grishman, C. Kruskal, K. MeAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - Designing an MIMD Shared Memory Parallel Computer. In IEEE Trans. on Computers, pages 175- 189, February 1983.
|
| |
6
|
T. Joe. Simulation Analysis for the Parameter Sensitivity of a Hybrid Interconnect Scheme for a Multiprocessor System. EE390 Report, Stanford University, May 1988.
|
| |
7
|
|
| |
8
|
|
| |
9
|
D. Lenoski, K. Gharachorloo, J. Laudon, A. Gupta, J. Hennessy, M. Horowitz, and M. Lam. Design of Scalable Shared-Memory Multiprocessors: The DASH Approach. In Procee&'ngs of the 35th IEEE Computer Society International Conference - COMPCON 90, 1990, to appear.
|
| |
10
|
D. Lenoski, J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy. The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor. Technical Report CSL-TR-89-404, Stanford University, December 1989.
|
 |
11
|
|
| |
12
|
|
| |
13
|
H. E. Mizrahi, J. L. Baer, E. D. Lazowska, andJ. Zahorjan. Extending the Memory Hierarchy into Multiprocessor Interconnection Networks: A Performance Analysis. In Proceedings of the 1989 International Conference on Parallel Processing, volume I, pages 41-50, August 1989.
|
| |
14
|
A. Norton and G. F. Pfister. A Methodology for Predicting Multiprocessor Performance. In Proceedings of the 1985 International Confer. ence on Parallel Processing, pages 772-781, 1985.
|
| |
15
|
G. Pfister, W. Brantley, D. George, S. Harvey, W. Kleinfelder, K. McAuliffe, E. Melton, A. Norton, and J. Weiss. The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture. In Proceedings of the 1985 International Conference on Parallel Processing, pages 764--771, 1985.
|
| |
16
|
R. Simoni. Implementing a Directory-Based Cache Consistency Protocol. Unpublished report, July 1988.
|
| |
17
|
J. Torrellas, T. Weil, and J. Hennessy. A Methodology for Modeling Interprocessor Traffic in Shared Memory Multiprocessors. Stanford University Technical Report No. CSL-TR-89-385, July 1989.
|
| |
18
|
|
| |
19
|
W. Whitt. The Queueing Network Analyzer. In The BELL System Technical Journal, volume 62, N.9, November 1983.
|
 |
20
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|