|
ABSTRACT
We present MRNet, a software-based multicast/reduction network for building scalable performance and system administration tools. MRNet supports multiple simultaneous, asynchronous collective communication operations. MRNet is flexible, allowing tool builders to tailor its process network topology to suit their tool's requirements and the underlying system's capabilities. MRNet is extensible, allowing tool builders to incorporate custom data reductions to augment its collection of built-in reductions. We evaluated MRNet in a simple test tool and also integrated into an existing, real-world performance tool with up to 512 tool back-ends. In the real-world tool, we used MRNet not only for multicast and simple data reductions but also with custom histogram and clock skew detection reductions. In our experiments, the MRNet-based tools showed significantly better performance than the tools without MRNet for average message latency and throughput, overall tool start-up latency, and performance data processing throughput.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
[1] Advanced Simulation and Computing program, National Nuclear Security Administration, United States of America Department of Energy. ¿http://www.nnsa.doe.gov/asc/home.htm¿, February 6, 2003.
|
| |
2
|
|
| |
3
|
[3] Susanne M. Balle. Personal communication, November 2002.
|
| |
4
|
|
| |
5
|
[5] M. Bernaschi and G. Iannello. Collective Communication Operations: Experimental Results vs. Theory. Concurrency: Practice and Experience 10, 5, April 1998, pp. 359-386.
|
| |
6
|
|
| |
7
|
[7] Center for Computational Research, University at Buffalo, The State University of New York. ¿http://www.ccr.buffalo.edu¿, February 6, 2003.
|
 |
8
|
David E. Culler , Richard M. Karp , David Patterson , Abhijit Sahay , Eunice E. Santos , Klaus Erik Schauser , Ramesh Subramonian , Thorsten von Eicken, LogP: a practical model of parallel computation, Communications of the ACM, v.39 n.11, p.78-85, Nov. 1996
[doi> 10.1145/240455.240477]
|
| |
9
|
[9] Earth Simulator Center. ¿http://www.es.jamstec.go.jp¿, February 6, 2003.
|
| |
10
|
[10] Etnus LLC, "TotalView User's Guide", Document version 6.0.0-1, January 2003. ¿http://www.etnus.com¿
|
| |
11
|
|
| |
12
|
[12] D.A. Evensky. Personal communication, November 2001.
|
| |
13
|
[13] Forecast Systems Laboratory, National Oceanic and Atmospheric Administration. ¿http://hpcs.fsl.noaa.gov¿, Feb 6, 2003.
|
| |
14
|
Jim Gray , Surajit Chaudhuri , Adam Bosworth , Andrew Layman , Don Reichart , Murali Venkatrao , Frank Pellow , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, Data Mining and Knowledge Discovery, v.1 n.1, p.29-53, 1997
[doi> 10.1023/A:1009726021843]
|
| |
15
|
J. K. Hollingsworth , O. Niam , B. P. Miller , Zhichen Xu , M. J. R. Goncalves , Ling Zheng, MDL: A Language And Compiler For Dynamic Program Instrumentation, Proceedings of the 1997 International Conference on Parallel Architectures and Compilation Techniques, p.201, November 11-15, 1997
|
 |
16
|
Richard M. Karp , Abhijit Sahay , Eunice E. Santos , Klaus Erik Schauser, Optimal broadcast and summation in the LogP model, Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures, p.142-153, June 30-July 02, 1993, Velen, Germany
[doi> 10.1145/165231.165250]
|
 |
17
|
|
| |
18
|
[18] Lawrence Livermore National Laboratory. Multiprogrammatic Capability Cluster. ¿http://www.llnl.gov/linux/mcr¿, February 6, 2003.
|
| |
19
|
[19] Lawrence Livermore National Laboratory. Using ASCI Blue Pacific. ¿http://www.llnl.gov/asci/platforms/bluepac¿, February 13, 2003.
|
 |
20
|
|
| |
21
|
[21] M.L. Massie, B.N. Chun, and D.E. Culler. The Ganglia Distributed Monitoring System: Design, Implementation, and Experience. University of California, Berkeley Technical Report, ¿http://ganglia.sourceforge.net/talks/parallel_computi ng/ganglia-twocol.pdf¿, February 2003.
|
| |
22
|
[22] Message Passing Interface Forum. MPI: A Message Passing Interface Standard. International Journal of Supercomputing Applications 8, 3/4, Fall/Winter 1994.
|
| |
23
|
Barton P. Miller , Mark D. Callaghan , Jonathan M. Cargille , Jeffrey K. Hollingsworth , R. Bruce Irvin , Karen L. Karavanic , Krishna Kunchithapadam , Tia Newhall, The Paradyn Parallel Performance Measurement Tool, Computer, v.28 n.11, p.37-46, November 1995
[doi> 10.1109/2.471178]
|
 |
24
|
|
| |
25
|
|
| |
26
|
[26] UoE HPCX Ltd. ¿http://www.hpcx.ac.uk¿, February 6, 2003.
|
| |
27
|
Sathish S. Vadhiyar , Graham E. Fagg , Jack Dongarra, Automatically tuned collective communications, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.3-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
28
|
|
CITED BY 14
|
|
|
|
|
|
|
|
Jyothish Varma , Chao Wang , Frank Mueller , Christian Engelmann , Stephen L. Scott, Scalable, fault tolerant membership for MPI tasks on HPC systems, Proceedings of the 20th annual international conference on Supercomputing, June 28-July 01, 2006, Cairns, Queensland, Australia
|
|
|
Martin Schulz , Dong Ahn , Andrew Bernat , Bronis R. de Supinski , Steven Y. Ko , Gregory Lee , Barry Rountree, Scalable dynamic binary instrumentation for Blue Gene/L, ACM SIGARCH Computer Architecture News, v.33 n.5, December 2005
|
|
|
|
|
|
|
|
|
Gregory L. Lee , Martin Schulz , Dong H. Ahn , Andrew Bernat , Bronis R. de Supinskil , Steven Y. Ko , Barry Rountree, Dynamic binary instrumentation and data aggregation on large scale systems, International Journal of Parallel Programming, v.35 n.3, p.207-232, June 2007
|
|
|
|
|
|
|
|
|
Martin Schulz , Jim Galarowicz , Don Maghrak , William Hachfeld , David Montoya , Scott Cranford, Open | SpeedShop: An open source infrastructure for parallel performance analysis, Scientific Programming, v.16 n.2-3, p.105-121, April 2008
|
|
|
Todd Gamblin , Bronis R. de Supinski , Martin Schulz , Rob Fowler , Daniel A. Reed, Scalable load-balance measurement for SPMD codes, Proceedings of the 2008 ACM/IEEE conference on Supercomputing, November 15-21, 2008, Austin, Texas
|
|
|
Gregory L. Lee , Dong H. Ahn , Dorian C. Arnold , Bronis R. de Supinski , Matthew Legendre , Barton P. Miller , Martin Schulz , Ben Liblit, Lessons learned at 208K: towards debugging millions of cores, Proceedings of the 2008 ACM/IEEE conference on Supercomputing, November 15-21, 2008, Austin, Texas
|
|
|
|
|
|
Michael Noeth , Prasun Ratn , Frank Mueller , Martin Schulz , Bronis R. de Supinski, ScalaTrace: Scalable compression and replay of communication traces for high-performance computing, Journal of Parallel and Distributed Computing, v.69 n.8, p.696-710, August, 2009
|
|