| Advanced collective communication in aspen |
| Full text |
Pdf
(496 KB)
|
Source
|
International Conference on Supercomputing
archive
Proceedings of the 22nd annual international conference on Supercomputing
table of contents
Island of Kos, Greece
SESSION: Communication & synchronization 1
table of contents
Pages 83-93
Year of Publication: 2008
ISBN:978-1-60558-158-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 59, Citation Count: 0
|
|
|
ABSTRACT
Aspen is a programming language that relies on high-level messaging to support communication among different program tasks executing in parallel. Unlike MPI, the computational logic of Aspen tasks is specified and developed independently of the global communication structure of the program. A root module specifies the communication structure of the program. The semantics and generality of these specifications enable novel forms of collective communication, including asynchronous and concurrent collective operations and reduction type operations with subsets of the participants being receivers of the reduced data, and with receivers that do not provide data to the reduction. This paper describes efficient implementations of these and other collective communication operations in Aspen. We demonstrate the ease-of-use of these features using several code examples and quantify their performance impact through both microbenchmarks and a quantum chemistry code used in rubber chemistry. Aspen's performance is competitive with, or slightly better than, the performance of MPI implementations for both the chemistry application and the microbenchmarks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
29th TOP500 List, June 2007.
|
| |
2
|
|
 |
3
|
Robert D. Blumofe , Christopher F. Joerg , Bradley C. Kuszmaul , Charles E. Leiserson , Keith H. Randall , Yuli Zhou, Cilk: an efficient multithreaded runtime system, Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming, p.207-216, July 19-21, 1995, Santa Barbara, California, United States
|
| |
4
|
|
 |
5
|
Ian Buck , Tim Foley , Daniel Horn , Jeremy Sugerman , Kayvon Fatahalian , Mike Houston , Pat Hanrahan, Brook for GPUs: stream computing on graphics hardware, ACM SIGGRAPH 2004 Papers, August 08-12, 2004, Los Angeles, California
|
| |
6
|
Brendan Burns , Kevin Grimaldi , Alexander Kostadinov , Emery D. Berger , Mark D. Corner, Flux: a language for programming high-performance servers, Proceedings of the annual conference on USENIX '06 Annual Technical Conference, p.13-13, May 30-June 03, 2006, Boston, MA
|
 |
7
|
|
| |
8
|
|
 |
9
|
Ernie Chan , Robert van de Geijn , William Gropp , Rajeev Thakur, Collective communication on architectures that support simultaneous communication over multiple links, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, March 29-31, 2006, New York, New York, USA
[doi> 10.1145/1122971.1122975]
|
| |
10
|
M. S. DeBergalis. A parallel file I/O API for Cilk. Master's thesis, Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science, May 2000.
|
| |
11
|
A. Goyal, J. Cao, P. Patkar, G. Medvedev, S. P. Midkiff, V. Venkatasubramanian, and J. M. Caruthers. Population balance kinetic model for interaction of 2-bisbenzothiazole-2-2 disulfide (mbts) with sulfur. Rubber Chemistry and Technology , 2007. In press.
|
| |
12
|
J. Gurd and W. Bohm. Implicit parallel processing: SISAL on the Manchester dataflow computer. In Proceedings of the IBM-Europe Institute on Parallel Programming, Aug. 1987.
|
| |
13
|
|
 |
14
|
|
| |
15
|
J. Hoeflinger. Extending OpenMP to clusters, 2006.
|
| |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
| |
21
|
Jelena Pješivac-Grbović , Thara Angskun , George Bosilca , Graham E. Fagg , Edgar Gabriel , Jack J. Dongarra, Performance analysis of MPI collective operations, Cluster Computing, v.10 n.2, p.127-143, June 2007
[doi> 10.1007/s10586-007-0012-0]
|
| |
22
|
R. Rabenseifner. Optimization of Collective Reduction Operations. In Proceedings of the International Conference on Computational Science , June 2004.
|
| |
23
|
R. Thakur and W. Gropp. Improving the performance of collective operations in mpich. In 10th European PVM/MPI Users Group Conference (Euro PVN/MPI 2003) , September 2003.
|
 |
24
|
|
| |
25
|
UPC Consortium. UPC Language Specification (Version 1.2), June 2005.
|
| |
26
|
N. Harvey and J. Morris. NL: A general purpose visual dataflow language. Australian Computer Journal, 12(1):2--12, 199622, 2007.
|
 |
27
|
|
 |
28
|
|
| |
29
|
|
 |
30
|
William Thies , Michal Karczmarek , Janis Sermulins , Rodric Rabbah , Saman Amarasinghe, Teleport messaging for distributed stream programs, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming, June 15-17, 2005, Chicago, IL, USA
[doi> 10.1145/1065944.1065975]
|
 |
31
|
P. H. Welch, An OCCAM approach to transputer engineering, Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues, p.138-147, January 19-20, 1988, Pasadena, California, United States
[doi> 10.1145/62297.62315]
|
 |
32
|
Matt Welsh , David Culler , Eric Brewer, SEDA: an architecture for well-conditioned, scalable internet services, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
| |
33
|
|
| |
34
|
Jun Cao and Ayush Goyal and Samuel P. Midkiff and James M. Caruthers An Optimizing Compiler for Parallel Chemistry Simulations In 21th International Parallel and Distributed Processing Symposium (IPDPS 2007).
|
|