|
ABSTRACT
Distributed memory parallel processors (DMPPs) have no hardware support for a global address space. However, conventional programs written in a sequential imperative language such as Fortran typically manipulate few, large arrays. The Oxygen compiler, developed as part of the K2 project, translates conventional Fortran code, augmented with code and data distribution directives, into C programs including SEND/RECEIVE communication primitives. The compiler directives, which are either supplied by the user, or for simple programs generated automatically, support a global name space through a run-time mechanism called data consistency analysis. We report in this paper the performance of seven parallel programs generated by Oxygen for three DMPPs, namely for a Parsytec Supercluster, an iWARP, and for the Fujitsu AP1000. All machines were configured as 8 x 8 tori.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
M. Annaratone , M. Fillo , M. Halbherr , R. Rühl , P. Steiner , M. Viredaz, The K2 distributed memory parallel processor: architecture, compiler, and operating system, Proceedings of the 1991 ACM/IEEE conference on Supercomputing, p.900-909, November 18-22, 1991, Albuquerque, New Mexico, United States
[doi> 10.1145/125826.126721]
|
 |
2
|
P. Beadle , C. Pommerell , M. Annaratone, K9: a simulator of distributed-memory parallel processors, Proceedings of the 1989 ACM/IEEE conference on Supercomputing, p.765-774, November 12-17, 1989, Reno, Nevada, United States
[doi> 10.1145/76263.76350]
|
| |
3
|
Parsytec. Megaframe- SuperCluster series, technical overview. Technical report, Aachen, 1988
|
 |
4
|
Shekhar Borkar , Robert Cohn , George Cox , Thomas Gross , H. T. Kung , Monica Lam , Margie Levine , Brian Moore , Wire Moore , Craig Peterson , Jim Susman , Jim Sutton , John Urbanski , Jon Webb, Supporting systolic and memory communication in iWarp, Proceedings of the 17th annual international symposium on Computer Architecture, p.70-81, May 28-31, 1990, Seattle, Washington, United States
|
| |
5
|
H. Ishihata et al. An architecture of highly parallel computer AP1000. In Pacific R,m Con/erence on Communications, Computers and S,gnal Processing, pages 13-16. IEEE, May 1991.
|
| |
6
|
R. Rtihl and M. Annaratone. A parallelizing compiler for distributed memory parallel processors. TechmcaI report, Swms Federal Institute of Technology Zurich, Integrated Systems Laboratory, 1991. Also to appear in IEEE Trans. on Par and Dist Syst.
|
| |
7
|
M Annaratone and R. Rfihl. Balancing interprocessor communication and computation on torus-connected multicomputers running compller-parallelized code. In Proc. SHPCC 92, Withamsburg VA, March 1992. IEEE
|
| |
8
|
C. Polychronopoulos et al. Parafrase-II: An environment for parallehzing, partitioning, synchronizing, and scheduling programs on multiprocessors. Technical report, University of Illinois, CSRD, 1989.
|
 |
9
|
Michael Burke , Ron Cytron , Jeanne Ferrante , Wilson Hsieh , Vivek Sarkar , David Shields, Automatic discovery of parallelism: a tool and an experiment (extended abstract), Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems, p.77-84, July 19-21, 1988, New Haven, Connecticut, United States
|
| |
10
|
S. Hiranandani, K. Kennedy, and C. Tseng. Compiler support for machine-independent parallel programming in Fortran D. Technical Report TR90-149, Rice Umversity, February 1991.
|
| |
11
|
D. CaIlahan and K. Kennedy Compiling programs for distributed memory multiprocessors. The Journal of Supercomput,ng, Vol 2:151-169, 1988.
|
 |
12
|
|
 |
13
|
C. Koelbel , P. Mehrotra , J. Van Rosendale, Supporting shared data structures on distributed memory architectures, Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming, p.177-186, March 14-16, 1990, Seattle, Washington, United States
|
 |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
 |
18
|
|
 |
19
|
|
| |
20
|
P. S Tseng, M Lain, and H. T Kung. The domain parallel computation model on Warp In Proc. SPIE. SPIE, 1988.
|
| |
21
|
|
 |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
M Neeracher and R Riihl. Automatic Parallelization of LIN- PACK Routines on Dmtribu~ed Memory Parallel Processors. Techmcal Report 92/7, Sw,ss Federal Institute of Technology Zurich, Integrated Systems Laboratory, 1992
|
| |
26
|
R. Riihl. Oxygen--A short tutorial Technical Report 91/9, Swiss Federal Institute of Technology Zurich, Integrated Systems Laboratory, 1991.
|
 |
27
|
|
| |
28
|
R. S. Martin and J H Wilkinson. Similarity reduction of a general matrix to Hessenberg form. In J H Wilkinson and C. Reinsch, editors, Handbook for Automatzc Computatzon. Vol. 2 (L~near Algebra), pages 339-358. Springer-Verlag, New York, 1971.
|
| |
29
|
B.S. Garbow, J M. Boyle, J.J. Dongarra, and C.B. Moler Matrix E~gensystvm t:~outinas - EISPACK guide Extension. Lecture notes in Computer Science Sprmger-Verlag, 1977.
|
| |
30
|
G Heiser, C. Pommerell, J. Weis, and W. Fichtner. Three dlmensmnal numerical semiconductor device simulation: Algorithms, architectures, results. IEEE Trans. on CAD, 1991.
|
| |
31
|
|
| |
32
|
Y. Sand. SPARSKIT A basic tool kit for sparse matrix computation Technical Report CSRD Report no. 1029, Umversity of Illinom, CSRD, August 1990.
|
| |
33
|
|
|