|
ABSTRACT
The paper considers the modular programming with hierarchically structured multi-processor tasks on top of SPMD tasks for distributed memory machines. The parallel execution requires a corresponding decomposition of the set of processors into a hierarchical group structure onto which the tasks are mapped. This results in a multi-level group SPMD computation model with varying processor group structures. The advantage of this kind of mixed task and data parallelism is a potential to reduce the communication overhead and to increase scalability. We present a runtime library to support the coordination of hierarchically structured multi-processor tasks. The library exploits an extended parallel group SPMD programming model and manages the entire task execution including the dynamic hierarchy of processor groups. The library is built on top of MPI, has an easy-to-use interface, and leads to only a marginal overhead while allowing static planning and dynamic restructuring.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
C.J. Beckmann and C. Polychronopoulos. Microarchitecture Support for Dynamic Scheduling of Acyclic Task Graphs. Technical Report CSRD Report 1207, University of Illinois, 1992.
|
 |
4
|
Soumen Chakrabarti , James Demmel , Katherine Yelick, Modeling the benefits of mixed data and task parallelism, Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, p.74-83, June 24-26, 1995, Santa Barbara, California, United States
[doi> 10.1145/215399.215423]
|
| |
5
|
F. Darema, D. A. George, V. A. Norton, and G. F. Pfis-ter. A single-program-multiple-data computational mode for EPEX/FORTRAN. Parallel Comput., 7(1):11--24, 1988.
|
| |
6
|
|
| |
7
|
|
| |
8
|
I. Foster, M. Xu, B. Avalani, and A. Choudhary. A Compilation System That Integrates High Performance Fortran and Fortran M. In Proceedings 1994 Scalable High Performance Computing Conference, pages 293--300. IEEE Computer Society Press, 1994.
|
| |
9
|
|
| |
10
|
M. Hill, W. McColl, and D. Skillicorn. Questions and Answers about BSP. Scientific Programming, 6(3):249--274, 1997.
|
| |
11
|
|
| |
12
|
|
| |
13
|
Scott R. Kohn , Scott B. Baden, Irregular coarse-grain data parallelism under LPARX, Scientific Programming, v.5 n.3, p.185-201, Fall 1996
|
| |
14
|
M. Kühnemann, T. Rauber, and G. Rünger. Performance Modelling for Task-Parallel Programs. In Proc. of the Communication Networks and Distributed Systems Modeling and Simulation Conference (CNDS 2002), pages 148--154, San Antonio, USA, 2002.
|
| |
15
|
|
 |
16
|
|
| |
17
|
T. Rauber and G. Rünger. Load Balancing Schemes for Extrapolation Methods. Concurrency: Practice and Experience, 9(3):181--202, 1997.
|
| |
18
|
T. Rauber and G. Rünger. Diagonal-Implicitly Iterated Runge-Kutta Methods on Distributed Memory Machines. Int. Journal of High Speed Computing, 10(2):185--207, 1999.
|
 |
19
|
|
 |
20
|
|
| |
21
|
P.J. van der Houwen, B.P. Sommeijer, and W. Couzy. Embedded Diagonally Implicit Runge-Kutta Algorithms on Parallel Computers. Mathematics of Computation, 58(197):135--159, January 1992.
|
| |
22
|
|
| |
23
|
G. Zhang, B. Carpenter, G. Fox, X. Li, and Y. Wen. A high level SPMD programming model: HPspmd and its Java language binding. Technical report, NPAC at Syracuse University, 1998.
|
CITED BY 2
|
|
|
|
|
Jarek Nieplocha , Manoj Krishnan , Bruce Palmer , Vinod Tipparaju , Yeliang Zhang, Exploiting processor groups to extend scalability of the GA shared memory programming model, Proceedings of the 2nd conference on Computing frontiers, May 04-06, 2005, Ischia, Italy
|
|