ACM Home Page
Please provide us with feedback. Feedback
A data-parallel implementation of O(N) hierarchical N-body methods
Full text PdfPdf (251 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM) table of contents
Pittsburgh, Pennsylvania, United States
Article No. 2  
Year of Publication: 1996
ISBN:0-89791-854-1
Authors
Yu Hu  Division of Applied Sciences, Harvard University, Cambridge, Massachusetts
S. Lennart Johnsson  Department of Computer Sciences, University of Houston, Houston, Texas
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 38,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/369028.369033
What is a DOI?

ABSTRACT

The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of 100 million particles or more to be simulated in acceptable time. We present a data-parallel implementation of Anderson's method and demonstrate both efficiency and scalability of the implementation on the Connection Machine CM-5/5E systems. The communication time for large particle systems amounts to about 10-25%, and the overall efficiency is about 35%. The evaluation of the potential field of a system of 100 million particles takes 3 minutes and 15 minutes on a 256 node CM-5E, giving expected four and seven digits of accuracy, respectively. The speed of the code scales linearly with the number of processors and number of particles.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
A. Appel. An efficient program for many-body simulation. SIAM J. Sci. Stat. Comput., 6(1):85-103, 1985.
 
3
 
4
J. Barnes and P. Hut. A hierarchical O(N log N) force calculation algorithm. Nature, 324:446-449, 1986.
 
5
J. A. Board Jr., Z. S. Hakura, W. D. Elliott, D. C. Gray, W. J. Blanke, and J. Leathrum Jr. Scalable implementations of multipole-accelerated algorithms for molecular dynamics. In Proc. Scalable High Performance Computing Conference SHPCC94. IEEE Computer Society Press, May 1994.
 
6
J. Carrier, L. Greengard, and V. Rokhlin. A fast adaptive multipole algorithm for particle simulations. SIAM Journal of Scientific and Statistical Computing, July 1988.
 
7
J. J. Dongarra, J. D. Croz, I. Duff, and S. Hammarling. A Set of Level 3 Basic Linear Algebra Subprograms: Model implementation and test programs. Technical Report Reprint No. 2, Argonne National Laboratories, Mathematics and Computer Science Division, August 1988.
 
8
J. J. Dongarra, J. D. Croz, S. Hammarling, and R. J. Hanson. An Extended Set of Fortran Basic Linear Algebra Subprograms. Technical Report Technical Memorandum 41, Argonne National Laboratories, Mathematics and Computer Science Division, November 1986.
 
9
W. D. Elliott and J. A. Board. Fast fourier transform accelerated fast multipole algorithm. Technical Report 94-001, Dept. of Electrical Engineering, Duke Univ., 1994.
 
10
 
11
K. Esselink. A comparison of algorithms for long range interactions. Technical Report 500.90.200GRA204-IN73916 AMER.94.006, Koninklijke/Shell-Laboratorium, Amsterdam, August 1994.
 
12
 
13
 
14
L. F. Greengard. The rapid evaluation of potential fields in particle systems. MIT Press, 1988.
 
15
L. F. Greengard and V. Rokhlin. On the efficient implementation of the fast multipole algorithm. Technical Report YALEU/DCS/RR-602, Dept. of Computer Science, Yale Univ., February 1988.
 
16
High Performance Fortran Forum. High performance fortran; language specification, version 1.0. Scientific Programming, 2(1 - 2):1-170, 1993.
 
17
Y. Hu and S. L. Johnsson. A data parallel implementation of hierarchical N-body methods. Technical Report TR-26-94, Harvard University, Division of Applied Sciences, September 1994.
 
18
Y. Hu and S. L. Johnsson. On the accuracy of Anderson's fast N-body algorithm. Technical report TR-06-96, Harvard University, Division of Applied Sciences, January 1996.
 
19
Y. Hu and S. L. Johnsson. A data-parallel adaptive O(N) hierarchical N-body method. Technical report, Harvard University, Division of Applied Sciences, 1996.
 
20
 
21
22
 
23
J. F. Leathrum. The parallel fast multipole method in three dimensions. PhD thesis, Duke University, 1992.
24
25
 
26
L. S. Nyland, J. F. Prins, and J. H. Reif. A data-parallel implementation of the adaptive fast multipole algorithm. In Proceedings of the DAGS '93 Symposium. Dartmouth Institute for Advanced Graduated Studies in Parallel Computation, Hanover, NH, 1993.
27
 
28
K. E. Schmidt and M. A. Lee. Implementing the fast multipole method in three dimensions. J. Stat. Phy., 63(5/6), 1991.
 
29
J. Singh, J. Hennessey, and A. Gupta. Implications of hierarchical N-body methods for multiprocessor architectures. Technical Report CSL-TR-92-506, Stanford University, 1992.
30
 
31
Thinking Machines Corp. CMSSL for CM Fortran, Version 3.1, 1993.
 
32
Thinking Machines Corp. CM Fortran Libraries Reference Manual, Version 2.2, 1994.
 
33
34
 
35
M. S. Warren and J. K. Salmon. A portable parallel particle program. Computer Physics Communications, 87, 1995.
 
36
 
37


Collaborative Colleagues:
Yu Hu: colleagues
S. Lennart Johnsson: colleagues