| Scaling irregular parallel codes with minimal programming effort |
| Full text |
Pdf
(153 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM)
table of contents
Denver, Colorado
Pages: 16 - 16
Year of Publication: 2001
ISBN:1-58113-293-X
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 1, Downloads (12 Months): 21, Citation Count: 7
|
|
|
ABSTRACT
The long foreseen goal of parallel programming models is to scale parallel code without significant programming effort. Irregular parallel applications are a particularly challenging application domain for parallel programming models, since they require domain specific data distribution and load balancing algorithms. From a performance perspective, shared-memory models still fall short of scaling as well as message-passing models in irregular applications, although they require less coding effort. We present a simple runtime methodology for scaling irregular applications parallelized with the standard OpenMP interface. We claim that our parallelization methodology requires the minimum amount of effort from the programmer and prove experimentally that it is able to scale two highly irregular codes as well as MPI, with an order of magnitude less programming effort. This is probably the first time such a result is obtained from OpenMP, more so, by keeping the OpenMP API intact.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
HPF+ Project Consortium. HPF+: Optimizing HPF for Advanced Applications. Deliverable 1.2c, Final Evaluation Report. http://www.par.univie.ac.at/project/hpf+, April 1998.
|
| |
3
|
M. Hall and M. Martonosi. Adaptive Parallelism in Compiler-Parallelized Code. In Proc. of the Second SUIF Compiler Workshop, Stanford, California, August 1997.
|
| |
4
|
|
| |
5
|
|
| |
6
|
Y. Charlie Hu , Alan Cox , Willy Zwaenepoel, Improving fine-grained irregular shared-memory benchmarks by data reordering, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.33-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
J. Labarta, E. Ayguadé, J. Oliver, and D. Henty. New OpenMP Directives for Irregular Data Access Loops. In Proc. of the Second European Workshop on OpenMP, Edinburgh, Scotland, September 2000.
|
| |
11
|
|
 |
12
|
|
 |
13
|
Dimitrios S. Nikolopoulos , Eduard Ayguadé , Theodore S. Papatheodorou , Constantine D. Polychronopoulos , Jesús Labarta, The trade-off between implicit and explicit data distribution in shared-memory programming paradigms, Proceedings of the 15th international conference on Supercomputing, p.23-37, June 2001, Sorrento, Italy
[doi> 10.1145/377792.377801]
|
 |
14
|
Dimitrios S. Nikolopoulos , Theodore S. Papatheodorou , Constantine D. Polychronopoulos , Jesús Labarta , Eduard Ayguadé, A case for user-level dynamic page migration, Proceedings of the 14th international conference on Supercomputing, p.119-130, May 08-11, 2000, Santa Fe, New Mexico, United States
[doi> 10.1145/335231.335243]
|
| |
15
|
Dimitrios S. Nikolopoulos , Theodore S. Papatheodorou , Constantine D. Polychronopoulos , Jesus Labarta , Eduard Ayguade;eacute;, Is data distribution necessary in OpenMP?, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.47-es, November 04-10, 2000, Dallas, Texas, United States
|
 |
16
|
|
 |
17
|
|
 |
18
|
D. Baxter , R. Mirchandaney , J. H. Saltz, Run-time parallelization and scheduling of loops, Proceedings of the first annual ACM symposium on Parallel algorithms and architectures, p.303-312, June 18-21, 1989, Santa Fe, New Mexico, United States
[doi> 10.1145/72935.72967]
|
| |
19
|
Hongzhang Shan , Jaswinder P. Singh , Leonid Oliker , Rupak Biswas, A comparison of three programming models for adaptive applications on the Origin2000, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.11-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
20
|
P. White. IFS Documentation: Part VI, Technical and Computational Procedures. Technical Report CY21R4, European Center for Medium-Range Forecasts, February 2000.
|
|