|
ABSTRACT
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is kernel-independent in the sense that the evaluation of pairwise interactions does not rely on any analytic expansions, but only utilizes kernel evaluations. The new method provides the enabling technology for many important problems in computational science and engineering. Examples include viscous flows, fracture mechanics and screened Coulombic interactions. Our MPI-based parallel implementation logically separates the computation and communication phases to avoid synchronization in the upward and downward computation passes, and thus allows us to fully exploit computation and communication overlapping. We measure isogranular and fixed-size scalability for a variety of kernels on the Pittsburgh Supercomputing Center's TCS-1 Alphaserver on up to 3000 processors. We have solved viscous flow problems with up to 2.1 billion unknowns and we have achieved 1.6 Tflops/s peak performance and 1.13 Tflops/s sustained performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
[2] Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Lois Curfman McInnes, and Barry F. Smith. PETSc home page. http://www.mcs.anl.gov/petsc, 2001.
|
| |
3
|
[3] Guy Blelloch and Girija Narlikar. A practical comparison of n-body algorithms. In Parallel Algorithms, Series in Discrete Mathematics and Theoretical Computer Science. American Mathematical Society, 1997.
|
| |
4
|
|
| |
5
|
[5] Matteo Frigo and Steven G. Johnson. FFTW home page. http://www.fftw.org, 2000.
|
| |
6
|
[6] Yuhong Fu et al. A fast solution for three-dimensional many-particle problems of linear elasticity. International Journal for Numerical Methods in Engineering, 42:1215-1229, 1998.
|
| |
7
|
[7] Leslie Greengard. The Rapid Evaluation of Potential Fields in Particle Systems. MIT Press, Cambridge, MA, 1988.
|
| |
8
|
|
| |
9
|
[9] Leslie Greengard and Vladimir Rokhlin. A new version of the fast multipole method for the Laplace equation in three dimensions. Acta Numerica, pages 229-269, 1997.
|
| |
10
|
|
| |
11
|
|
| |
12
|
[12] Rainer Kress. Linear Integral Equations. Applied Mathematical Sciences. Springer, 1999.
|
| |
13
|
K. Nabors , F. T. Korsmeyer , F. T. Leighton , J. White, Preconditioned, adaptive, multipole-accelerated iterative methods for three-dimensional first-kind integral equations of potential theory, SIAM Journal on Scientific Computing, v.15 n.3, p.713-735, May 1994
[doi> 10.1137/0915046]
|
| |
14
|
[14] Aiichiro Nakano. Parallel multilevel preconditioned conjugate-gradient approach to variable-charge molecular dynamics. Computer Physics Communications, 104:59-69, 1997.
|
 |
15
|
Aiichiro Nakano , Rajiv K. Kalia , Priya Vashishta , Timothy J. Campbell , Shuji Ogata , Fuyuki Shimojo , Subhash Saini, Scalable atomistic simulation algorithms for materials research, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), p.1-1, November 10-16, 2001, Denver, Colorado
[doi> 10.1145/582034.582035]
|
 |
16
|
John J. Ottusch , Mark A. Stalzer , John L. Visher , Stephen M. Wandzura, Scalable electromagnetic scattering calculations on the SGI Origin 2000, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.54-es, November 14-19, 1999, Portland, Oregon, United States
[doi> 10.1145/331532.331586]
|
| |
17
|
[17] Nhan Phan-Thien, Ka Yan Lee, and David Tullock. Large scale simulation of suspensions with PVM. In Proceedings of SC97, The SCxy Conference series, San Jose, CA, November 1997. ACM/IEEE.
|
| |
18
|
James C. Phillips , Gengbin Zheng , Sameer Kumar , Laxmikant V. Kalé, NAMD: biomolecular simulation on thousands of processors, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-18, November 16, 2002, Baltimore, Maryland
|
| |
19
|
[19] V. Popov and Henry Power. An O(N) taylor sereis multipole boundary element method for three-dimensional elasticity problems. Engineering Analysis with Boundary Elements, 25:7-18, 2001.
|
 |
20
|
|
 |
21
|
J. P. Singh , C. Holt , J. L. Hennessy , A. Gupta, A parallel adaptive fast multipole method, Proceedings of the 1993 ACM/IEEE conference on Supercomputing, p.54-65, December 1993, Portland, Oregon, United States
[doi> 10.1145/169627.169651]
|
| |
22
|
|
| |
23
|
|
 |
24
|
|
| |
25
|
[25] Lexing Ying, George Biros, and Denis Zorin. A kernel-independent fast multipole algorithm. Technical Report TR2003-839, Courant Institute, New York University, 2003. http://www.cs.nyu.edu/csweb/Research/TechReports/TR2003-839/TR2003- 839.pdf.
|
| |
26
|
[26] Kenichi Yoshida, Naoshi Nishimura, and Shoichi Kobayashi. Application of fast multipole Galerkin boundary integral equation method to elastostatic crack problems in 3D. International Journal for Numerical Methods in Engineering, 50(3):525.
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hari Sundar , Rahul S. Sampath , Santi S. Adavani , Christos Davatzikos , George Biros, Low-constant parallel algorithms for finite element simulations using linear octrees, Proceedings of the 2007 ACM/IEEE conference on Supercomputing, November 10-16, 2007, Reno, Nevada
|
|