| LoGPC: modeling network contention in message-passing programs |
| Full text |
Pdf
(1.41 MB)
|
| Source
|
Joint International Conference on Measurement and Modeling of Computer Systems
archive
Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
table of contents
Madison, Wisconsin, United States
Pages: 254 - 263
Year of Publication: 1998
ISBN:0-89791-982-3
Also published in ...
|
|
Authors
|
|
Csaba Andras Moritz
|
Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts
|
|
Matthew I. Frank
|
Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 30, Citation Count: 4
|
|
|
ABSTRACT
In many real applications, for example those with frequent and irregular communication patterns or those using large messages, network contention and contention for message processing resources can be a significant part of the total execution time. This paper presents a new cost model, called LoGPC, that extends the LogP [9] and LogGP [4] models to account for the impact of network contention and network interface DMA behavior on the performance of message-passing programs.We validate LoGPC by analyzing three applications implemented with Active Messages [11, 18] on the MIT Alewife multiprocessor. Our analysis shows that network contention accounts for up to 50% of the total execution time. In addition, we show that the impact of communication locality on the communication costs is at most a factor of two on Alewife. Finally, we use the model to identify tradeoffs between synchronous and asynchronous message passing styles.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Anant Agarwal , Ricardo Bianchini , David Chaiken , Kirk L. Johnson , David Kranz , John Kubiatowicz , Beng-Hong Lim , Kenneth Mackenzie , Donald Yeung, The MIT Alewife machine: architecture and performance, Proceedings of the 22nd annual international symposium on Computer architecture, p.2-13, June 22-24, 1995, S. Margherita Ligure, Italy
|
| |
3
|
|
 |
4
|
Albert Alexandrov , Mihai F. Ionescu , Klaus E. Schauser , Chris Scheiman, LogGP: incorporating long messages into the LogP model—one step closer towards a realistic model for parallel computation, Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, p.95-105, June 24-26, 1995, Santa Barbara, California, United States
[doi> 10.1145/215399.215427]
|
| |
5
|
C. Andras Moritz, K. AI-Tawit, B. Fraguela Rodriguez, "MPI performance comparison on M PP and workstation clusters," appears in l'twc, of the l Oth Int. Conf on Parallel and Distributed Computing, New Orleans, October 1997.
|
| |
6
|
G. Chochia, C. Boeres, E Thanisch, "Analysis of Multicotnputer Schedules in Cost and Latency Model of Communication", Abstract Machine Workshop, 1996.
|
| |
7
|
|
 |
8
|
A. Krishnamurthy , D. E. Culler , A. Dusseau , S. C. Goldstein , S. Lumetta , T. von Eicken , K. Yelick, Parallel programming in Split-C, Proceedings of the 1993 ACM/IEEE conference on Supercomputing, p.262-273, December 1993, Portland, Oregon, United States
[doi> 10.1145/169627.169724]
|
 |
9
|
David Culler , Richard Karp , David Patterson , Abhijit Sahay , Klaus Erik Schauser , Eunice Santos , Ramesh Subramonian , Thorsten von Eicken, LogP: towards a realistic model of parallel computation, Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming, p.1-12, May 19-22, 1993, San Diego, California, United States
|
| |
10
|
D. Culler, L. Liu, R. Martin, and C. Yoshikawa, "LogP Performance Assessment of Fast Network Interfaces," Technical Report, Computer Science Division, university of California, Berkeley, November 1995, (available on the Web.)
|
 |
11
|
Thorsten von Eicken , David E. Culler , Seth Copen Goldstein , Klaus Erik Schauser, Active messages: a mechanism for integrated communication and computation, Proceedings of the 19th annual international symposium on Computer architecture, p.256-266, May 19-21, 1992, Queensland, Australia
|
 |
12
|
|
| |
13
|
Chris Holt , Mark Heinrich , Jaswinder P Singh , Edward Rothberg , John Hennessy, The Effects of Latency, Occupancy, and Bandwidth in Distributed Shared Memory Multiprocessors, Stanford University, Stanford, CA, 1995
|
 |
14
|
Matthew I. Frank , Anant Agarwal , Mary K. Vernon, LoPC: modeling contention in parallel algorithms, Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming, p.276-287, June 18-21, 1997, Las Vegas, Nevada, United States
|
| |
15
|
K. Keeton, T. Anderson, and D. Patterson, "LogP Quantified: The Case for Low-Overhead Local Area Networks," Hot Interconnects 111: A Syrup. on High Performance Interconnects, Stanford University, Stanford, CA, Aug. I 0-12, 1995.
|
 |
16
|
|
| |
17
|
C. E Kruskal and M. Snir, "The performance of multistage interconnection networks for multiprocessors," IEEE Trans. Comput., Vol. 37, pp. 1091-1098, Dec 1983.
|
| |
18
|
K. Mackenzie , J. Kubiatowicz , M. Frank , W Lee , V Lee , A. Agarwal , M. Kaashoek, Exploiting Two-Case Delivery for Fast Protected Messaging, Proceedings of the 4th International Symposium on High-Performance Computer Architecture, p.231, January 31-February 04, 1998
|
| |
19
|
N.K. Madsen. "Divergence preserving discrete surface integral methods for Maxwell's curl equations using nonorthogonal unstructured grids", Technical Report 92.04, RIACS, Februaxy 1992.
|
 |
20
|
Richard P. Martin , Amin M. Vahdat , David E. Culler , Thomas E. Anderson, Effects of communication latency, overhead, and bandwidth in a cluster architecture, Proceedings of the 24th annual international symposium on Computer architecture, p.85-97, June 01-04, 1997, Denver, Colorado, United States
|
| |
21
|
|
CITED BY 4
|
|
Jens Mache , Virginia Lo , Marilynn Livingston , Sharad Garg, The impact of spatial layout of jobs on parallel I/O performance, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.45-56, May 05-05, 1999, Atlanta, Georgia, United States
|
|
|
|
|
|
|
|
|
|
|