| An experimental study of methods for parallel preconditioned Krylov methods |
| Full text |
Pdf
(1.13 MB)
|
| Source
|
Hypercube Concurrent Computers and Applications
archive
Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
table of contents
Pasadena, California, United States
Pages: 1698 - 1711
Year of Publication: 1989
ISBN:0-89791-278-0
|
|
Authors
|
|
D. Baxter
|
Department of Computer Science, Yale University, New Haven, CT
|
|
J. Saltz
|
Department of Computer Science, Yale University, New Haven, CT
|
|
M. Schultz
|
Department of Computer Science, Yale University, New Haven, CT
|
|
S. Eisenstat
|
Department of Computer Science, Yale University, New Haven, CT
|
|
K. Crowley
|
Department of Computer Science, Yale University, New Haven, CT
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 11, Citation Count: 6
|
|
|
ABSTRACT
High performance multiprocessor architectures differ both in the number of processors, and in the delay costs for synchronization and communication. In order to obtain good performance on a given architecture for a given problem, adequate parallelization, good balance of load and an appropriate choice of granularity are essential.
We discuss the implementation of parallel version of PCGPAK for both shared memory architectures and hypercubes. Our parallel implementation is sufficiently efficient to allow us to complete the solution of our test problems on 16 processors of the Encore Multimax/320 in an amount of time that is a small multiple of that required by a single head of a Cray X/MP, despite the fact that the peak performance of the Multimax processors is not even close to the supercomputer range. We illustrate the effectiveness of our approach on a number of model problems from reservoir engineering and mathematics.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Greenbaum, Solving Sparse Triangular Linear Systems Using Fortran with Pamlllel Eztensions on the NYU Ultmcomputer Prototype, Report 99, NYU Ultracomputer Note, April 1986.
|
| |
2
|
|
| |
3
|
D. M. Nicol and J. H. Saltz, Principles for Problem Aggregation and Assignment in Medium Scale Multiprocessors, Technical Report 87-39, ICASE, September 1987.
|
| |
4
|
Todd Dupont, Richard P. Kendall and H. H. Rachford Jr., An approximate factorization procedure for solving self-adjoint elliptic difference equations, SIAM Journal on NumericM Analysis, 5(1968), pp. 559-573.
|
| |
5
|
|
| |
6
|
|
| |
7
|
J. $altz, Automated Problem Scheduling and Reduction of Synchronization Delay Effects, Report 87-22, }CASE report, July 1987.
|
| |
8
|
J. A. Meijerink and H. A. van der Vorst, Guidelines for the usage of incomplete decompositions in solving sets of linear equations as occur in practical problems, Journal of Computational Physics, 44 (1981), pp. 134-155.
|
| |
9
|
Scientific Computing Associates, PC'GPAK: Benchmarks for the FPS and CRA~/XMP, Technical Report 111, Scientific Computing Associates, New Haven Connecticut, 1987.
|
| |
10
|
--------, PCGPAK: User's Guide, Technical Report 106, Scientific Computing Associates, New Haven Connecticut, 1984.
|
| |
11
|
|
| |
12
|
3oel Saltz, Ravi Mirchandaney, Roger M. Smith, David M. Nico{ and Kay Crowley, The Automated Crystal Runtime System: A Framework, Technical Report 588, Yale University Department of Computer Science, January 1988.
|
| |
13
|
Y. Saad, M. Schultz, Parallel Implementations of Preconditioned Conjugate Gradient Methods, Department of Computer Science YALEU/DCS/TR-425, Yale University, October 1985.
|
CITED BY 6
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
D. Baxter , R. Mirchandaney , J. H. Saltz, Run-time parallelization and scheduling of loops, Proceedings of the first annual ACM symposium on Parallel algorithms and architectures, p.303-312, June 18-21, 1989, Santa Fe, New Mexico, United States
|
|