|
ABSTRACT
ARMI is a communication library that provides a framework for expressing fine-grain parallelism and mapping it to a particular machine using shared-memory and message passing library calls. The library is an advanced implementation of the RMI protocol and handles low-level details such as scheduling incoming communication and aggregating outgoing communication to coarsen parallelism when necessary. These details can be tuned for different platforms to allow user codes to achieve the highest performance possible without manual modification. ARMI is used by STAPL, our generic parallel library, to provide a portable, user transparent communication layer. We present the basic design as well as the mechanisms used in the current Pthreads/OpenMP, MPI implementations and/or a combination thereof. Performance comparisons between ARMI and explicit use of Pthreads or MPI are given on a variety of machines, including an HP V2200, SGI Origin 3800, IBM Regatta-HPC and IBM RS6000 SP cluster.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Guy E. Blelloch , Charles E. Leiserson , Bruce M. Maggs , C. Greg Plaxton , Stephen J. Smith , Marco Zagha, A comparison of sorting algorithms for the connection machine CM-2, Proceedings of the third annual ACM symposium on Parallel algorithms and architectures, p.3-16, July 21-24, 1991, Hilton Head, South Carolina, United States
[doi> 10.1145/113379.113380]
|
| |
3
|
S. Bova, R. Eigenmann, H. Gabb, G. Gaertner, B. Kuhn, B. Magro, S. Salvini, and V. Vatsa. Combining message-passing and directives in parallel applications. SIAM News, 32(9):10--14, 1999.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
Madhusudhan Govindaraju , Aleksander Slominski , Venkatesh Choppella , Randall Bramley , Dennis Gannon, Requirements for and evaluation of RMI protocols for scientific computing, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.61-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
9
|
|
| |
10
|
|
 |
11
|
William Mclendon, III , Bruce Hendrickson , Steve Plimpton , Lawrence Rauchwerger, Finding strongly connected components in parallel in particle transport sweeps, Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures, p.328-329, July 2001, Crete Island, Greece
[doi> 10.1145/378580.378751]
|
| |
12
|
B. Joy, G. Steele, J. Gosling, and G. Bracha. Java(TM) Language Specification (2nd Edition). Reading, MA: Addison-Wesley Pub Co, 2000.
|
 |
13
|
Laxmikant V. Kale , Sanjeev Krishnan, CHARM++: a portable concurrent object oriented system based on C++, Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications, p.91-108, September 26-October 01, 1993, Washington, D.C., United States
|
| |
14
|
|
 |
15
|
|
| |
16
|
Message Passing Interface Forum. MPI: A Message-Passing Interface Standard, June 1995. www.mpi-forum.org.
|
| |
17
|
Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface, May 1998. www.mpi-forum.org.
|
| |
18
|
S. Microsystems. Java remote method invocation (RMI). http://java.sun.com/products/jdk/rmi/, 1995--2002.
|
| |
19
|
|
| |
20
|
|
 |
21
|
Dimitrios S. Nikolopoulos , Eduard Ayguadé , Theodore S. Papatheodorou , Constantine D. Polychronopoulos , Jesús Labarta, The trade-off between implicit and explicit data distribution in shared-memory programming paradigms, Proceedings of the 15th international conference on Supercomputing, p.23-37, June 2001, Sorrento, Italy
[doi> 10.1145/377792.377801]
|
| |
22
|
OpenMP Architecture Review Board. OpenMP - C and C++ Application Program Interface, October 1998. Document DN 004-2229-001, www.openmp.org.
|
| |
23
|
A. Ping, A. Jula, S. Rus, S. Saunders, T. Smith, G. Tanase, N. Thomas, N. Amato, and L. Rauchwerger. STAPL: An adaptive, generic parallel C++ library. In Int. Workshop on Languages and Compilers for Parallel Computing, 2001.
|
| |
24
|
|
 |
25
|
|
| |
26
|
L. Smith. Mixed mode MPI/OpenMP programming. UK High-End Computing Technology Report, http://www.ukhec.ac.uk/publications/, 2000.
|
| |
27
|
|
 |
28
|
Thorsten von Eicken , David E. Culler , Seth Copen Goldstein , Klaus Erik Schauser, Active messages: a mechanism for integrated communication and computation, Proceedings of the 19th annual international symposium on Computer architecture, p.256-266, May 19-21, 1992, Queensland, Australia
|
CITED BY 3
|
|
Nathan Thomas , Gabriel Tanase , Olga Tkachyshyn , Jack Perdue , Nancy M. Amato , Lawrence Rauchwerger, A framework for adaptive algorithm selection in STAPL, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming, June 15-17, 2005, Chicago, IL, USA
|
|
|
|
|
|
Gabriel Tanase , Mauro Bianco , Nancy M. Amato , Lawrence Rauchwerger, The STAPL pArray, Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture, p.73-80, September 16-16, 2007, Brasov, Romania
|
|