ACM Home Page
Please provide us with feedback. Feedback
Adapting a message-driven parallel application to GPU-accelerated clusters
Full text PdfPdf (143 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2008 ACM/IEEE conference on Supercomputing - Volume 00 table of contents
Austin, Texas
SECTION: Papers table of contents
Article No. 8  
Year of Publication: 2008
ISBN:978-1-4244-2835-9
Authors
James C. Phillips  University of Illinois at Urbana-Champaign, Urbana, IL
John E. Stone  University of Illinois at Urbana-Champaign, Urbana, IL
Klaus Schulten  University of Illinois at Urbana-Champaign, Urbana, IL
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 52,   Downloads (12 Months): 498,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Graphics processing units (GPUs) have become an attractive option for accelerating scientific computations as a result of advances in the performance and flexibility of GPU hardware, and due to the availability of GPU software development tools targeting general purpose and scientific computation. However, effective use of GPUs in clusters presents a number of application development and system integration challenges. We describe strategies for the decomposition and scheduling of computation among CPU cores and GPUs, and techniques for overlapping communication and CPU computation with GPU kernel execution. We report the adaptation of these techniques to NAMD, a widely-used parallel molecular dynamics simulation package, and present performance results for a 64-core 64-GPU cluster.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. E. Stone, J. C. Phillips, P. L. Freddolino, D. J. Hardy, L. G. Trabuco, and K. Schulten, "Accelerating molecular modeling applications with graphics processors," J. Comp. Chem., vol. 28, pp. 2618--2640, 2007.
 
2
J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone, and J. C. Phillips, "GPU computing," Proceedings of the IEEE, vol. 96, pp. 879--899, 2008.
 
3
 
4
I. Ufimtsev and T. Martinez, "Quantum chemistry on graphical processing units. 1. strategies for two-electron integral evaluation," Journal of Chemical Theory and Computation, vol. 4, no. 2, pp. 222--231, 2008.
5
6
7
 
8
M. Charalambous, P. Trancoso, and A. Stamatakis, "Initial experiences porting a bioinformatics application to a graphics processor." in Panhellenic Conference on Informatics, 2005, pp. 415--425.
 
9
 
10
E. Elsen, V. Vishal, M. Houston, V. Pande, P. Hanrahan, and E. Darve, "N-body simulations on GPUs," Stanford University, Stanford, CA, Tech. Rep., Jun. 2007, http://arxiv.org/abs/0706.3060.
 
11
"NVIDIA CUDA Compute Unified Device Architecture Programming Guide," NVIDIA, NVIDIA, Santa Clara, CA, USA, 2007.
 
12
M. McCool, "Data-parallel programming on the Cell BE and the GPU using the RapidMind development platform," in GSPx Multicore Applications Conference, Oct./Nov. 2006.
 
13
Advanced Micro Devices Inc., "Brook+ SC07 BOF session," in Supercomputing 2007 Conference, Nov. 2007.
 
14
J. Stratton, S. Stone, and W. mei Hwu, "MCUDA: An efficient implementation of CUDA kemels on multi-cores," University of Illinois at Urbana-Champaign, Tech. Rep. IMPACT-08-01, March 2008. {Online}. Available: http://www.gigascale.org/pubs/1278.html
15
 
16
 
17
18
 
19
 
20
 
21
 
22
J. C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R. D. Skeel, L. Kale, and K. Schulten, "Scalable molecular dynamics with NAMD," J. Comp. Chem., vol. 26, pp. 1781--1802, 2005.
 
23
M. Nelson, W. Humphrey, A. Gursoy, A. Dalke, L. Kalé, R. Skeel, K. Schulten, and R. Kufrin, "MDScope - A visual computing environment for structural biology," Comput. Phys. Commun., vol. 91, no. 1--3, pp. 111--134, 1995.
 
24
M. Nelson, W. Humphrey, A. Gursoy, A. Dalke, L. Kalé, R. D. Skeel, and K. Schulten, "NAMD - A parallel, object-oriented molecular dynamics program," Int. J. Supercomp. Appl. High Perform. Comp., vol. 10, pp. 251--268, 1996.
 
25
 
26
P. L. Freddolino, F. Liu, M. Gruebele, and K. Schulten, "Tenmicrosecond MD simulation of a fast-folding WW domain," Biophys. J., vol. 94, pp. L75-L77, 2008.
 
27
P. L. Freddolino, A. S. Arkhipov, S. B. Larson, A. McPherson, and K. Schulten, "Molecular dynamics simulations of the complete satellite tobacco mosaic virus," Structure, vol. 14, pp. 437--449, 2006.
 
28
L. V. Kale and S. Krishnan, "Charm++: Parallel Programming with Message-Driven Objects," in Parallel Programming using C++, G. V. Wilson and P. Lu, Eds. MIT Press, 1996, pp. 175--213.
 
29
L. V. Kale, E. Bohm, C. L. Mendes, T. Wilmarth, and G. Zheng, "Programming Petascale Applications with Charm++ and AMPI," in Petascale Computing: Algorithms and Applications, D. Bader, Ed. Chapman & Hall / CRC Press, 2008, pp. 421--441.
 
30
 
31
32
 
33
L. V. Kalé, "The virtualization model of parallel programming: Runtime optimizations and the state of art," in LACSI 2002, Albuquerque, October 2002.
 
34
 
35

Collaborative Colleagues:
James C. Phillips: colleagues
John E. Stone: colleagues
Klaus Schulten: colleagues