ACM Home Page
Please provide us with feedback. Feedback
Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator
Full text PdfPdf (544 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2003 ACM/IEEE conference on Supercomputing table of contents
Page: 13  
Year of Publication: 2003
ISBN:1-58113-695-1
Author
Kengo Nakajima  Research Organization for Information Science and Technology (RIST), Tokyo, Japan
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 29,   Citation Count: 2
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

An efficient parallel iterative method with selective blocking preconditioning has been developed for symmetric multiprocessor (SMP) cluster architectures with vector processors such as the Earth Simulator. This method is based on a three-level hybrid parallel programming model, which includes message passing for inter-SMP node communication, loop directives by OpenMP for intra-SMP node parallelization and vectorization for each processing element (PE). This method provides robust and smooth convergence and excellent vector and parallel performance in 3D geophysical simulations with contact conditions performed on the Earth Simulator. The selective blocking preconditioning is much more efficient than ILU(1) and ILU(2). Performance for the complicated Southwest Japan model with more than 23 M DOF on 10 SMP nodes (80 PEs) of the Earth Simulator was 161.7 GFLOPS, corresponding to 25.3% of the peak performance for hybrid programming model, and 190.4 GFLOPS (29.8% of the peak performance) for flat MPI, respectively.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
[1] Earth Simulator Center Web Site: http://www.es.jamstec.go.jp/
 
2
[2] GeoFEM Web Site: http://geofem.tokyo.rist.or.jp/
 
3
[3] Iizuka, M., Okuda, H. and Yagawa, G.: "Nonlinear Structural Subsystem of GeoFEM forFault Zone Analysis", Pure and Applied Geophysics, Vol. 157 (2000), pp. 2105-2124.
 
4
[4] Nakajima, K. and Okuda, H. "Parallel Iterative Solvers with Selective Blocking Preconditioning for Simulations of Fault Zone Contact", 2001 International Conference on Preconditioning Techniques for Large Sparse Matrix Problems in Industrial Applications (Preconditioning 2001), Tahoe City, CA, USA, 2001, Journal of Numerical Algebra with Applications (in press).
 
5
[5] Nakajima, K.: "OpenMP/MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Method", International Workshop on OpenMP: Experiences and Implementations (WOMPEI 2003), Tokyo, Japan, Lecture Notes in Computer Science 2858 (in press), Springer, 2003.
 
6
[6] Nakajima, K., Okuda, H.: "Parallel Iterative Solvers with Localized ILU Preconditioning for Unstructured Grids on Workstation Clusters", International Journal for Computational Fluid Dynamics, Vol. 12 (1999) pp. 315-322.
 
7
 
8
 
9
 
10
[10] Accelerated Strategic Computing Initiative (ASCI) Web Site : http://www.llnl.gov/asci/
 
11
 
12
[12] Djomehri, M.J. and Jin, H.H.: "Hybrid MPI+OpenMP Programming of an Overset CFD Solver and Performance Investigations", NASA/NAS Technical Report (NASA Ames Research Center), NAS-02-002, (2002). 16.
 
13
[13] Falgout, R. and Jones, J. : "Multigrid on Massively Parallel Architectures", Sixth European Multigrid Conference , Ghent, Belgium, September 27-30, 1999.
 
14
 
15
[15] MPI Web Site : http://www.mpi.org/
 
16
[16] OpenMP Web Site : http//www.openmp.org/
 
17
 
18
[18] Washio, T., Maruyama, K., Osoda, T., Shimizu, F. and Doi, S. : "Blocking and reordering to achieve highly parallel robust ILU preconditioners", RIKEN Symposium on Linear Algebra and its Applications, The Institute of Physical and Chemical Research, 1999, pp. 42-49.
 
19
[19] Washio, T., Maruyama, K., Osoda, T., Shimizu, F. and Doi, S. : "Efficient implementations of block sparse matrix operations on shared memory vector machines", SNA2000 : The Fourth International Conference on Supercomputing in Nuclear Applications, 2000.
 
20
 
21
[21] Doi, S. and Washio : "Using Multicolor Ordering with Many Colors to Strike a Better Balance between Parallelism and Convergence", RIKEN Symposium on Linear Algebra and its Applications, The Institute of Physical and Chemical Research, 1999, pp. 19-26.
 
22
[22] Kerbyson, D.J., Hoisie, A. and Wasserman, H. "A Comparison Between the Earth Simulator and AlphaServer Systems using Predictive Application Performance Models", LA-UR-02-5222, Los Alamos National Laboratory, USA, 2002.
 
23
[23] Ezure, S., Okuda, H. and Nakajima, K. : "Parallel Mesh Relocation, Parallel Finite Element Analysis, Large-Scale Simulation", RIST/Tokyo GeoFEM Report 2002-012, 2002.
 
24