ACM Home Page
Please provide us with feedback. Feedback
OhHelp: a scalable domain-decomposing dynamic load balancing for particle-in-cell simulations
Full text PdfPdf (805 KB)
Source
International Conference on Supercomputing archive
Proceedings of the 23rd international conference on Supercomputing table of contents
Yorktown Heights, NY, USA
SESSION: Optimizing parallel applications table of contents
Pages 90-99  
Year of Publication: 2009
ISBN:978-1-60558-498-0
Authors
Hiroshi Nakashima  Kyoto University, Kyoto, Japan
Yohei Miyake  Kyoto University, Uji, Japan
Hideyuki Usui  Kyoto University, Uji, Japan
Yoshiharu Omura  Kyoto University, Uji, Japan
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 46,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1542275.1542293
What is a DOI?

ABSTRACT

This paper proposes a new method for Particle-in-Cell (PIC) simulations which aims at achieving both good load balancing and scalability so as to be efficiently executed on distributed memory systems. This method, named OhHelp, simply and equally partitions the space domain where charged particles reside and assigns each partitioned subdomain to each computation node for scalable simulation with respect to the size of the domain. Load balancing and thus the scalability in terms of the number of particles are accomplished by making each node help another heavily loaded node which deputes a part of particles in its subdomain and replicated subdomain field data to its helpers. The OhHelp load balancer monitors particle movements through subdomain boundaries to check if the helpand-helpers configuration keeps good load balancing and, when unacceptable imbalance is found, dynamically reconfigures it to regain perfect balancing. The efficiency and scalability of OhHelp are confirmed through our experiment with a production-level full-3D plasma simulator and with uniform and non-uniform particle distributions. As a result, we found 256-core parallel simulations, including an extremely imbalanced setting to pack all the particles in a small region, exert 159-190 speedup compared to sequential execution.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
J. Barnes and P. Hutt. A hierarchical O (N log N ) force calculation algorithm. Nature, 324:446--449, Dec. 1986.
 
3
S. Briguglio, B. D. Martino, G. Fogaccia, and G. Vlad. Hierarchical MPI+OpenMP implementation of parallel PIC applications on clusters of symmetric multiprocessors. In Euro PVM/MPI 2003, volume 2840 of Lecture Notes in Computer Science, pages 180--187. Springer-Verlag, 2003.
 
4
S. Briguglio, B. D. Martino, and G. Vlad. Workload decomposition strategies for hierarchical distributed-shared memory parallel systems and their implementation with integration of high-level parallel languages. Concurrency and Computation:Practice and Experience, 14:933--956, 2002.
 
5
P. M. Campbell and E. A. Carmona. Hierarchical domain decomposition with unitary load balancing for electromagnetic particle-in-cell codes. In Proc. 5th Distributed Memory Computing Conf., pages 943--950, 1990.
 
6
E. A. CarmonaandL. J. Chandler. OnparallelPIC versatility and the structure of parallel PIC approaches. Concurrency: Practice and Experience, 9(12):1377--1405, 1997.
 
7
J. M. Dawson. Particle simulation of plasmas. Review of Modern Physics, 55(2):403--447, 1983.
 
8
S. Ethier, W. M. Tang, and Z. Lin. Gyrokinetic particle-in-cell simulations of plasma microturbulence on advanced computing platforms. J. Physics: Conference Series, 16:1--15, 2005.
 
9
 
10
G. C. Fox. A review of automatic load balancing and decomposition methods for the Hypercube. In Numerical Algorithms for Modern Parallel Computer Architectures, pages 63--76. Springer-Verlag, 1988.
 
11
D. F. Gordon, W. B. Mori, and T. M. Antonsen. A ponderomotive guiding center particle-in-cell code for efficient modeling of laser-plasma interactions. IEEE Trans. Plasma Science, 28(4):1135--1143, 2000.
 
12
J. Joseph D. Blahovec, L. A. Bowers, J. W. Luginsland, G. E. Sasser, and J. J. Watrous. 3-D ICEPIC simulations of the relativistic klystron oscillator. IEEE Trans. Plasma Science, 28(3):821--829, 2000.
 
13
E. Kawamura, C. K. Birdsall, and V. Vahedi. Physical and numerical methods of speeding up particle codes and paralleling as applied to RF discharges. Plasma Sources Science & Technology, 9:413--428, 2000.
 
14
15
 
16
C. S. Lin, A. L. Thring, and J. Koga. A parallel particle-in-cell model for the massively parallel processor. In Proc. Symp. Frontiers of Massively Parallel Computation, pages 339--342, 1988.
 
17
O. M. Lubeck and V. Faber. Modeling the performance of Hypercubes: A case study using the particle-in-cell application. Parallel Computing, 9:37--52, 1988.
 
18
 
19
B. Moon and J. Saltz. Adaptive runtime support for direct simulation Monte Carlo methods on distributed memory architectures. In Proc. Scalable High-Performance Computing Conference, pages 176--183, 1994.
 
20
 
21
S. J. Plimpton, D. B. Seidel, M. F. Pasik, R. S. Coats, and G. R. Montry. A load-balancing algorithm for a parallel electromagnetic particle-in-cell code. Computer Physics Communications, 152:227--241, 2003.
 
22
 
23
 
24
 
25
D. B. Seidel, S. J. Plimpton, M. F. Pasik, R. S. Coats, and G. R. Montry. Dynamic load balancing for a parallel electromagnetic particle-in-cell code. In Proc. IEEE Intl. Pulsed Power Conf., 2001.
 
26
 
27
R. Thackera, G. Pringleb, H. M. P. Couchmana, and S. Boothb. HYDRA-MPI: An adaptive particle-particle, particle-mesh code for conducting cosmological simulations on MPP architectures. In Proc. Intl. Symp. High Performance Computing Systems and Applications, 2003.
 
28
T. Umeda, Y. Omura, T. Tominaga, and H. Matsumoto. A new charge conservation method in electro-magnetic particle-in-cell simulations. Computer Physics Communications, 156:73--85, 2003.
 
29
H. Usui et al. Development and application of geo-space environment simulator for the analysis of spacecraft-plasma interactions. IEEE Trans. Plasma Science, 34(5):2094--2102, 2006.
 
30
D. W. Walker. The implementation of a three dimensional PIC code on a Hypercube concurrent processor. In Proc. 4th Conf. Hypercube Concurrent Computers and Applications, 1989.
 
31
D. W. Walker. Particle-in-cell plasma simulation codes on the Connection Machine. Computer Systems in Engineering, 2(2--3):307--319, 1991.
 
32
F. Wolfheimer, E. Gjonaj, and T. Weiland. A parallel 3D particle-in-cell code with dynamic load balancing. Nuclear Instruments and Methods in Physics Research, A-558:202--204, 2006.

Collaborative Colleagues:
Hiroshi Nakashima: colleagues
Yohei Miyake: colleagues
Hideyuki Usui: colleagues
Yoshiharu Omura: colleagues