|
ABSTRACT
In this article, we present Dendro, a suite of parallel algorithms for the discretization and solution of partial differential equations (PDEs) involving second-order elliptic operators. Dendro uses trilinear finite element discretizations constructed using octrees. Dendro, comprises four main modules: a bottom-up octree generation and 2:1 balancing module, a meshing module, a geometric multiplicative multigrid module, and a module for adaptive mesh refinement (AMR). Here, we focus on the multigrid and AMR modules. The key features of Dendro are coarsening/refinement, inter-octree transfers of scalar and vector fields, and parallel partition of multilevel octree forests. We describe a bottom-up algorithm for constructing the coarser multigrid levels. The input is an arbitrary 2:1 balanced octree-based mesh, representing the fine level mesh. The output is a set of octrees and meshes that are used in the multigrid sweeps. Also, we describe matrix-free implementations for the discretized PDE operators and the intergrid transfer operations. We present results on up to 4096 CPUs on the Cray XT3 ("BigBen"), the Intel 64 system ("Abe"), and the Sun Constellation Linux cluster ("Ranger").
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Vokan Akcelik , Jacobo Bielak , George Biros , Ioannis Epanomeritakis , Antonio Fernandez , Omar Ghattas , Eui Joong Kim , Julio Lopez , David O'Hallaron , Tiankai Tu , John Urbanic, High Resolution Forward And Inverse Earthquake Modeling on Terascale Computers, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, p.52, November 15-21, 2003
|
 |
3
|
W. K. Anderson , W. D. Gropp , D. K. Kaushik , D. E. Keyes , B. F. Smith, Achieving high sustained performance in an unstructured mesh CFD application, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.69-es, November 14-19, 1999, Portland, Oregon, United States
[doi> 10.1145/331532.331600]
|
| |
4
|
Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matt Knepley, Lois Curfman McInnes, Barry F. Smith, and Hong Zhang. PETSc home page, 2001. www.mcs.anl.gov/petsc.
|
| |
5
|
|
| |
6
|
R. Becker and M. Braack. Multigrid techniques for finite elements on locally refined meshes. Numerical Linear Algebra with applications, 7:363--379, 2000.
|
| |
7
|
|
| |
8
|
Marshall W. Bern, David Eppstein, and Shang-Hua Teng. Parallel construction of quadtrees and quality triangulations. International Journal of Computational Geometry and Applications, 9(6):517--532, 1999.
|
| |
9
|
M. Bittencourt and R. Feij'oo. Non-nested multigrid methods in finite element linear structural analysis. In Virtual Proceedings of the 8th Copper Mountain Conference on Multigrid Methods (MGNET), 1997.
|
| |
10
|
Hans-Joachim Bungartz, Miriam Mehl, and Tobias Weinzierl. A parallel adaptive cartesian pde solver using space-filling curves. In E. Wolfgang Nagel, V. Wolfgang Walter, and Wolfgang Lehner, editors, Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, volume 4128 of LNCS, pages 1064--1074, Berlin Heidelberg, 2006. Springer-Verlag.
|
| |
11
|
Paul M. Campbell, Karen D. Devine, Joseph E. Flaherty, Luis G. Gervasio, and James D. Teresco. Dynamic octree load balancing using space-filling curves. Technical Report CS-03-01, Williams College Department of Computer Science, 2003.
|
| |
12
|
Edmond Chow, Robert D. Falgout, Jonathan J. Hu, Raymond S. Tuminaro, and Ulrike M. Yang. A survey of parallelization techniques for multigrid solvers. In Michael A. Heroux, Padma Raghavan, and Horst D. Simon, editors, Parallel Processing for Scientific Computing, pages 179--195. Cambridge University Press, 2006.
|
| |
13
|
William M. Deen. Analysis of transport phenomena. Topics in Chemical Engineering. Oxford University Press, New York, 1998.
|
| |
14
|
L. F. Demkowicz, J. T. Oden, and W. Rachowicz. Toward a universal hp adaptive finite element strategy part 1: Constrained approximation and data structure. Computer Methods in Applied Mechanics and Engineering, 77:79--112, 1989.
|
| |
15
|
R. D. Falgout, J. E. Jones, and U. M. Yang. The design and implementation of Hypre, a library of parallel high performance preconditioners. In A. M. Bruaset and A. Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, volume 51, pages 267--294. Springer-Verlag, 2006.
|
| |
16
|
|
| |
17
|
D. M. Greaves and A. G. L. Borthwick. Hierarchical tree-based finite element mesh generation. International Journal for Numerical Methods in Engineering, 45(4):447--471, 1999.
|
| |
18
|
M. Griebel and G. Zumbusch. Parallel multigrid in an adaptive PDE solver based on hashing. In E. H. D'Hollander, G. R. Joubert, F. J. Peters, and U. Trottenberg, editors. Parallel Computing: Fundamentals, Applications and New Directions, Proceedings of the Conference ParCo '97, 19--22 September 1997, Bonn, Germany, volume 12, pages 589--600, Amsterdam, 1998. Elsevier, North-Holland.
|
| |
19
|
William D. Gropp , Dinesh K. Kaushik , David E. Keyes , Barry Smith, Performance modeling and tuning of an unstructured mesh CFD application, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.34-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
20
|
|
| |
21
|
Morton E. Gurtin. An introduction to continuum mechanics, volume 158 of Mathematics in Science and Engineering. Academic Press, San Diego, 2003.
|
| |
22
|
|
| |
23
|
Frank Hulsemann, Markus Kowarschik, Marcus Mohr, and Ulrich Rude. Parallel geometric multigrid. In Are M. Bruaset and Aslka Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, pages 165--208. Birkauser, 2006.
|
| |
24
|
A. C. Jones and P. K. Jimack. An adaptive multigrid tool for elliptic and parabolic systems. International Journal for Numerical Methods in Fluids, 47:1123--1128, 2005.
|
 |
25
|
|
| |
26
|
|
| |
27
|
Miriam Mehl. Cache-optimal data-structures for hierarchical methods on adaptively refined space-partitioning grids, September 2006.
|
| |
28
|
Miriam Mehl, Tobias Weinzierl, and Christoph Zenger. A cache-oblivious self-adaptive full multigrid method. Numerical Linear Algebra with Applications, 13(2--3):275--291, 2006.
|
| |
29
|
NCSA. Abe's system architecture. ncsa.uiuc.edu/UserInfo/Resources/Hardware/Intel64Cluster.
|
| |
30
|
|
| |
31
|
PSC. Bigben's system architecture, www.psc.edu/machines/cray/xt3.
|
| |
32
|
Rahul Sampath and George Biros. A parallel geometric multigrid method for finite elements on octree meshes. Technical report, Georgia Institute of Technology, 2008. Submitted for publication.
|
| |
33
|
Rahul Sampath, Hari Sundar, Santi S. Adavani, Ilya Lashuk, and George Biros. Dendro home page, 2008. www.cc.gatech.edu/csela/dendro.
|
 |
34
|
Hari Sundar , Rahul S. Sampath , Santi S. Adavani , Christos Davatzikos , George Biros, Low-constant parallel algorithms for finite element simulations using linear octrees, Proceedings of the 2007 ACM/IEEE conference on Supercomputing, November 10-16, 2007, Reno, Nevada
[doi> 10.1145/1362622.1362656]
|
| |
35
|
|
| |
36
|
TACC. Ranger's system architecture. www.tacc.utexas.edu/resources/hpcsystems.
|
| |
37
|
|
| |
38
|
|
| |
39
|
|
 |
40
|
Brian S. White , Sally A. McKee , Bronis R. de Supinski , Brian Miller , Daniel Quinlan , Martin Schulz, Improving the computational intensity of unstructured mesh applications, Proceedings of the 19th annual international conference on Supercomputing, June 20-22, 2005, Cambridge, Massachusetts
[doi> 10.1145/1088149.1088195]
|
|