ACM Home Page
Please provide us with feedback. Feedback
Low-constant parallel algorithms for finite element simulations using linear octrees
Full text PdfPdf (1.03 MB)
Source
Conference on High Performance Networking and Computing archive
Proceedings of the 2007 ACM/IEEE conference on Supercomputing - Volume 00 table of contents
Reno, Nevada
SESSION: PDE applications table of contents
Article No. 25  
Year of Publication: 2007
ISBN:978-1-59593-764-3
Authors
Hari Sundar  University of Pennsylvania, Philadelphia, PA
Rahul S. Sampath  University of Pennsylvania, Philadelphia, PA
Santi S. Adavani  University of Pennsylvania, Philadelphia, PA
Christos Davatzikos  University of Pennsylvania, Philadelphia, PA
George Biros  University of Pennsylvania, Philadelphia, PA
Sponsors
IEEE-CS\DATC : IEEE Computer Society
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 42,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1362622.1362656
What is a DOI?

ABSTRACT

In this article we propose parallel algorithms for the construction of conforming finite-element discretization on linear octrees. Existing octree-based discretizations scale to billions of elements, but the complexity constants can be high. In our approach we use several techniques to minimize overhead: a novel bottom-up tree-construction and 2:1 balance constraint enforcement; a Golomb-Rice encoding for compression by representing the octree and element connectivity as an Uniquely Decodable Code (UDC); overlapping communication and computation; and byte alignment for cache efficiency. The cost of applying the Laplacian is comparable to that of applying it using a direct indexing regular grid discretization with the same number of elements. Our algorithm has scaled up to four billion octants on 4096 processors on a Cray XT3 at the Pittsburgh Supercomputing Center. The overall tree construction time is under a minute in contrast to previous implementations that required several minutes; the evaluation of the discretization of a variable-coefficient Laplacian takes only a few seconds.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
 
4
Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Barry F. Smith, and Hong Zhang. PETSc Web page, 2001. http://www.mcs.anl.gov/petsc.
 
5
R. Becker and M. Braack. Multigrid techniques for finite elements on locally refined meshes. Numerical Linear Algebra with applications, 7:363--379, 2000.
 
6
 
7
Marshall W. Bern, David Eppstein, and Shang-Hua Teng. Parallel construction of quadtrees and quality triangulations. International Journal of Computational Geometry and Applications, 9(6):517--532, 1999.
 
8
Paul M. Campbell, Karen D. Devine, Joseph E. Flaherty, Luis G. Gervasio, and James D. Teresco. Dynamic octree load balancing using space-filling curves. Technical Report CS-03-01, Williams College Department of Computer Science, 2003.
 
9
S. W. Golomb. Run-length encodings. IEEE Transactions on Information Theory, 12(3):399--401, 1966.
 
10
 
11
M. Griebel and G. Zumbusch. Parallel multigrid in an adaptive PDE solver based on hashing. In E. H. D'Hollander, G. R. Joubert, F. J. Peters, and U. Trottenberg, editors, Parallel Computing: Fundamentals, Applications and New Directions, Proceedings of the Conference ParCo '97, 19--22 September 1997, Bonn, Germany, volume 12, pages 589--600, Amsterdam, 1998. Elsevier, North-Holland.
 
12
 
13
 
14
E. Kim, J. Bielak, O. Ghattas, and J. Wang. Octree-based finite element method for large-scale earthquake ground motion modeling in heterogeneous basins. AGU Fall Meeting Abstracts, pages B1221+, December 2002.
15
 
16
 
17
 
18
R. F. Rice. Some practical universal noiseless coding techniques. Technical Report JPL Publication 79--22, Jet Propulsion Laboratory, Pasadena, California, 1979.
 
19
Hari Sundar, Rahul S. Sampath, and George Biros. Bottom-up construction and 2:1 balance re nement of linear octrees in parallel. University of Pennsylvania Technical Report, MS-CIS-07-05, 2007.
 
20
Tiankai Tu and David R. O'Hallaron. Balance refinement of massive linear octree datasets. CMU Technical Report, CMU-CS-04(129), 2004.
 
21
Tiankai Tu and David R. O'Hallaron. Extracting hexahedral mesh structures from balanced linear octrees. In 13th International Meshing Roundtable, pages 191--200, Williamsburg, VA, Sandia National Laboratories, September 19--22 2004.
 
22
23
 
24
25
 
26
 
27

Collaborative Colleagues:
Hari Sundar: colleagues
Rahul S. Sampath: colleagues
Santi S. Adavani: colleagues
Christos Davatzikos: colleagues
George Biros: colleagues