| Linearly scaling 3D fragment method for large-scale electronic structure calculations |
| Full text |
Pdf
(418 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2008 ACM/IEEE conference on Supercomputing - Volume 00
table of contents
Austin, Texas
SECTION: ACM Gordon Bell finalists
table of contents
Article No. 65
Year of Publication: 2008
ISBN:978-1-4244-2835-9
|
|
Authors
|
|
Lin-Wang Wang
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
Byounghak Lee
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
Hongzhang Shan
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
Zhengji Zhao
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
Juan Meza
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
Erich Strohmaier
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
David H. Bailey
|
Lawrence Berkeley National Laboratory, Berkeley, CA
|
|
| Publisher |
IEEE Press
Piscataway, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 106, Citation Count: 0
|
|
|
ABSTRACT
We present a new linearly scaling three-dimensional fragment (LS3DF) method for large scale ab initio electronic structure calculations. LS3DF is based on a divide-and-conquer approach, which incorporates a novel patching scheme that effectively cancels out the artificial boundary effects due to the subdivision of the system. As a consequence, the LS3DF program yields essentially the same results as direct density functional theory (DFT) calculations. The fragments of the LS3DF algorithm can be calculated separately with different groups of processors. This leads to almost perfect parallelization on over one hundred thousand processors. After code optimization, we were able to achieve 60.3 Tflop/s, which is 23.4% of the theoretical peak speed on 30,720 Cray XT4 processor cores. In a separate run on a BlueGene/P system, we achieved 107.5 Tflop/s on 131,072 cores, or 24.2% of peak. Our 13,824-atom ZnTeO alloy calculation runs 400 times faster than a direct DFT calculation, even presuming that the direct DFT calculation can scale well up to 17,280 processor cores. These results demonstrate the applicability of the LS3DF method to material simulations, the advantage of using linearly scaling algorithms over conventional O(N3) methods, and the potential for petascale computation using the LS3DF method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Luque and A. Marti. Increasing the efficiency of ideal solar cells by photon induced tansitions at intermediate lavels. Phys. Rev. Lett., 78:5014, 1997.
|
| |
2
|
K. M. Yu, W. Walukiewicz, J. Wu, W. Shan, J. W. Beeman, M. A. Scarpulla, O. D. Dubon, and P. Becta. Diluted ii-vi oxide semiconductors with multiple band gaps. Phys. Rev. Lett., 91:246403, 2003.
|
| |
3
|
P. Hohenberg and W. Kohn. Inhomogeneous electron gas. Phys. Rev., 136:B864, 1964.
|
| |
4
|
W. Kohn and L. J. Sham. Self-consistent equations including exchange and correlation effects. Phys. Rev., 140:A1133, 1965.
|
| |
5
|
F. Gygi, E. Draeger, B. R. de Supinski, R. K. Yates, F. Franchetti, S. Kral, J. Lorenz, C. W. Ueberhuber, J. Gunnels, and J. Sexton. Large-scale first-principles molecular dynamics simulations on the bluegene/1 platform using the qbox code. ACM, 2005.
|
| |
6
|
|
| |
7
|
L.-W. Wang. Parallel planewave pseudopotential ab initio package, http://hpcrd.lbl.gov/linwang/petot/petot.html, 2004.
|
| |
8
|
|
| |
9
|
G. Goedecker. Linear scaling electronic structure methods. Rev. Mod. Phys, 71:1085, 1999.
|
| |
10
|
|
| |
11
|
José M Soler, Emilio Artacho, Julian D Gale, Alberto García, Javier Junquera, Pablo Ordejón, and Daniel Sánchez-Portal. The siesta method for ab initio order-n materials simulation. Journal of Physics: Condensed Matter, 14(11):2745--2779, 2002.
|
| |
12
|
|
| |
13
|
W. Yang. Direct calculation of electron density in density-functional theory. Phys. Rev. Lett., 66:1438, 1991.
|
| |
14
|
F. Shimojo, R. K. Kalia, A. Nakano, and P. Vashishta. Embedded divide-and-conquer algorithm on hierarchical real-space grids: parallel molecular dynamics simulation based on lineàrscaling density functional theory. Comput. Phys. Commun., 167:151, 2005.
|
| |
15
|
W. Kohn. Density functional and density matrix method scaling linearly with the number of atoms. Phys. Rev. Lett., 76(17):3168--3171, 1996.
|
| |
16
|
L.-W. Wang, Z. Zhao, and J. Meza. Linear scaling three-dimensional fragment method for large-scale electronic structure calculations. Phys. Rev. B, 77:165113, 2008.
|
| |
17
|
Z. Zhao, J. Meza, and L.-W. Wang. A divide and conquer linear scaling three dimensional fragment method for large scale electronic structure calculations. J. Phys. Cond. Matter, 20(294203), 2008.
|
| |
18
|
L.-W. Wang and J. Li. First-principles thousand-atoms quantum dot calculations. Phys. Rev. B, 69:153302, 2004.
|
| |
19
|
J. Schrier, D. O. Demchenko, L. W. Wang, and A. P. Alivisatos. Optical properties of zno/zns and zno/znte heterostructures for photovoltaic applications. NanoLett., 7:2377, 2007.
|
| |
20
|
M. C. Payne, M. P. Teter, D. C. Allan, T. A. Arias, and J. D. Joannopoulos. Iterative minimization techniques for ab initio total-energy calculations: molecular dynamics and conjugate gradients. Rev. Mod. Phys., 64:1045, 1992.
|
| |
21
|
|
| |
22
|
L. W. Wang and A. Zunger. Solving schrodinger's equation around a desired energy: Application to silicon quantum dots. J. Chem. Phys., 100:2394, 1994.
|
|