| An HPF compiler for the IBM SP2 |
| Full text |
Html
(6 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM)
table of contents
San Diego, California, United States
Article No. 71
Year of Publication: 1995
ISBN:0-89791-816-9
|
|
Authors
|
|
Manish Gupta
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Sam Midkiff
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Edith Schonberg
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Ven Seshadri
|
IBM Software Solutions Division, 1150 Eglinton Ave. East, North York, Ontario, Canada, M3C 1V7
|
|
David Shields
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Ko-Yang Wang
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Wai-Mee Ching
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
Ton Ngo
|
IBM T.J. Watson Research, P.O. Box 704, Yorktown Heights, NY
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 6, Citation Count: 29
|
|
|
ABSTRACT
We describe pHPF, an research prototype HPF compiler for the IBM SP series parallel machines. The compiler accepts as input Fortran 90 and Fortran 77 programs, augmented with HPF directives; sequential loops are automatically parallelized. The compiler supports symbolic analysis of expressions. This allows parameters such as the number of processors to be unknown at compile-time without significantly affecting performance. Communication schedules and computation guards are generated in a parameterized form at compile-time. Several novel optimizations and improved versions of well-known optimizations have been implemented in pHPF to exploit parallelism and reduce communication costs. These optimizations include elimination of redundant communication using data-availability analysis; using collective communication; new techniques for mapping scalar variables; coarse-grain wavefronting; and communication reduction in multi-dimensional shift communications. We present experimental results for some well-known benchmark routines. The results show the effectiveness of the compiler in generating efficient code for HPF programs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Prithviraj Banerjee , John A. Chandy , Manish Gupta , Eugene W. Hodges IV , John G. Holm , Antonio Lain , Daniel J. Palermo , Shankar Ramaswamy , Ernesto Su, The Paradigm Compiler for Distributed-Memory Multicomputers, Computer, v.28 n.10, p.37-47, October 1995
[doi> 10.1109/2.467577]
|
| |
3
|
|
| |
4
|
T. Brandes. ADAPTOR: A compilation system for data-parallel Fortran programs. In C. W. Kessler, editor, Automatic parallelization -- new approaches to code generation, data distribution, and performance prediction. Vieweg Advanced Studies in Computer Science, Vieweg, Wiesbaden, January 1994.
|
 |
5
|
|
 |
6
|
|
| |
7
|
ANSI Fortran 90 Standard Committee. Fortran 90, 1990. ANSI standard X3.198-199x, which is identical to ISO standard ISO/IEC 1539:1991.
|
| |
8
|
High Performance Fortran Forum. High Performance Fortran language specification, version 1.0. Technical Report CRPC-TR92225, Rice University, May 1993.
|
 |
9
|
|
| |
10
|
|
 |
11
|
|
| |
12
|
Doug Kimelman , Pradeep Mittal , Edith Schonberg , Peter F. Sweeney , Ko-Yang Wang , Dror Zernik, Visualizing the execution of High Performance Fortran (HPF) programs, Proceedings of the 9th International Symposium on Parallel Processing, p.750-759, April 25-28, 1995
|
| |
13
|
Charles H. Koelbel , David B. Loveman , Robert S. Schreiber , Guy L. Steele, Jr. , Mary E. Zosel, The high performance Fortran handbook, MIT Press, Cambridge, MA, 1994
|
| |
14
|
|
| |
15
|
J.M. Levesque. Applied Parallel Research's xHPF system. IEEE Parallel & Distributed Technologies, page 71, Fall 1994.
|
| |
16
|
|
 |
17
|
|
| |
18
|
V.J. Schuster. PGHPF from The Portland Group. IEEE Parallel & Distributed Technologies, page 72, Fall 1994.
|
| |
19
|
E. Su, D. J. Palermo, and P. Banerjee. Automating parallelization of regular computations for distributed memory multicomputers in the PARADIGM compiler. In Proc. 1993 International Conference on Parallel Processing, St. Charles, IL, August 1993.
|
| |
20
|
R. Thakur, R. Bordawekar, and A. Choudhary. Compiler and runtime support for out-of-core HPF programs. Technical Report SCCS-597, NPAC, Syracuse University, 1994.
|
 |
21
|
|
| |
22
|
|
| |
23
|
H. Zima and B. Chapman. Compiling for distributed-memory systems. Proceedings of the IEEE, 81-13(2):264--287, Feb 1993.
|
CITED BY 29
|
|
|
|
|
Gerald Roth , John Mellor-Crummey , Ken Kennedy , R. Gregg Brickner, Compiling stencils in high performance Fortran, Proceedings of the 1997 ACM/IEEE conference on Supercomputing (CDROM), p.1-20, November 15-21, 1997, San Jose, CA
|
|
|
Bradford L. Chamberlain , Steven J. Deitz , Lawrence Snyder, A comparative study of the NAS MG benchmark across parallel languages and architectures, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.46-es, November 04-10, 2000, Dallas, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. Di Martino , S. Briguglio , M. Celino , G. Fogaccia , G. Vlad , V. Rosato , M. Briscolini, Development of large scale high performance applications with a parallelizing compiler, Practical parallel computing, Nova Science Publishers, Inc., Commack, NY, 2001
|
|
|
Jack Dongarra , Ian Foster , Geoffrey Fox , William Gropp , Ken Kennedy , Linda Torczon , Andy White, References, Sourcebook of parallel computing, Morgan Kaufmann Publishers Inc., San Francisco, CA, 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shuo Yang , Ali R. Butt , Y. Charlie Hu , Samuel P. Midkiff, Trust but verify: monitoring remotely executing programs for progress and correctness, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming, June 15-17, 2005, Chicago, IL, USA
|
|
|
|
|
|
Seung-Jai Min , Rudolf Eigenmann, Combined compile-time and runtime-driven, pro-active data movement in software DSM systems, Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems, p.1-6, October 22-23, 2004, Houston, Texas
|
|
|
|
|
|
|
|
|
Christopher Barton , CĆlin Casçaval , George Almási , Yili Zheng , Montse Farreras , Siddhartha Chatterje , José Nelson Amaral, Shared memory programming for large scale machines, ACM SIGPLAN Notices, v.41 n.6, June 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anthony Danalis , Lori Pollock , Martin Swany , John Cavazos, MPI-aware compiler optimizations for improving communication-computation overlap, Proceedings of the 23rd international conference on Supercomputing, June 08-12, 2009, Yorktown Heights, NY, USA
|
|