|
ABSTRACT
In this work, the emphasis is on the development of strategies to realize techniques of numerical computing on the graphics chip. In particular, the focus is on the acceleration of techniques for solving sets of algebraic equations as they occur in numerical simulation. We introduce a framework for the implementation of linear algebra operators on programmable graphics processors (GPUs), thus providing the building blocks for the design of more complex numerical algorithms. In particular, we propose a stream model for arithmetic operations on vectors and matrices that exploits the intrinsic parallelism and efficient communication on modern GPUs. Besides performance gains due to improved numerical computations, graphics algorithms benefit from this model in that the transfer of computation results to the graphics processor for display is avoided. We demonstrate the effectiveness of our approach by implementing direct solvers for sparse matrices, and by applying these solvers to multi-dimensional finite difference equations, i.e. the 2D wave equation and the incompressible Navier-Stokes equations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Anderson , Z. Bai , C. Bischof , L. S. Blackford , J. Demmel , Jack J. Dongarra , J. Du Croz , S. Hammarling , A. Greenbaum , A. McKenney , D. Sorensen, LAPACK Users' guide (third ed.), Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999
|
| |
2
|
ATI, 2003. Sample effects on the ATI graphics cards. http://www.ati.com/developer/techpapers.html.
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
Elder, G. 2002. Radeon 9700. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2002.
|
 |
12
|
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
 |
17
|
Wolfgang Heidrich , Rüdiger Westermann , Hans-Peter Seidel , Thomas Ertl, Applications of pixel textures in visualization and realistic image synthesis, Proceedings of the 1999 symposium on Interactive 3D graphics, p.127-134, April 26-29, 1999, Atlanta, Georgia, United States
[doi> 10.1145/300523.300538]
|
 |
18
|
|
| |
19
|
|
| |
20
|
Hopf, M., and Ertl, T. 2000. Hardware accelerated wavelet transformations. In Proceedings EG/IEEE TCVG Symposium on Visualization VisSym '00, 93--103.
|
| |
21
|
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
| |
25
|
Microsoft, 2002. DirectX9 SDK. http://www.microsoft.com/DirectX.
|
| |
26
|
Montrym, J., and Moreton, H. 2002. GeForce4. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2002.
|
| |
27
|
NVidia, 2002. nvidia OpenGL game of life. http://www.nvidia.com/view.asp?IO=ogl-gameoflife.
|
| |
28
|
NVidia, 2003. Sample effects on the nVIDIA graphics cards. http://developer.nvidia.com/view.asp?PAGE=papers.
|
 |
29
|
|
| |
30
|
|
 |
31
|
|
| |
32
|
|
| |
33
|
Strzodka, R., and Rumpf, M. 2001. Nonlinear diffusion in graphics hardware. In Proceedings EG/IEEE TCVG Symposium on Visualization 2001, 75--84.
|
| |
34
|
Strzodka, R., and Rumpf, M. 2001. Using graphics cards for quantized FEM computations. In Proceedings VIIP 2001, 98--107.
|
| |
35
|
|
| |
36
|
|
| |
37
|
Weiskopf, D., Hopf, M., and Ertl, T. 2002. Hardware-accelerated Lagrangian-Eulerian texture advection for 2D flow visualization. In Proceedings Workshop on Vision, Modeling, and Visualization VMV '02.
|
CITED BY 3
|
|
Samer Al-Kiswany , Abdullah Gharaibeh , Elizeu Santos-Neto , George Yuan , Matei Ripeanu, StoreGPU: exploiting graphics processing units to accelerate distributed storage systems, Proceedings of the 17th international symposium on High performance distributed computing, June 23-27, 2008, Boston, MA, USA
|
|
|
|
|
|
|
|