|
ABSTRACT
Decision Support System (DSS) workloads are known to be one of the most time-consuming database workloads that processes large data sets. Traditionally, DSS queries have been accelerated using large-scale multiprocessor. The topic addressed in this work is to analyze the benefits of using high-performance/low-cost processors such as the GPUs and the Cell/BE to accelerate DSS query execution. In order to overcome the programming effort of developing code for different architectures, in this work we explore the use of a platform, Rapidmind, which offers the possibility of executing the same program on both Cell/BE and GPUs. To achieve this goal we propose data-parallel versions of the original database scan and join algorithms. In our experimental results we compare the execution of three queries from the standard DSS benchmark TPC-H on two systems with two different GPU models, a system with the Cell/BE processor, and a system with dual quad-core Xeon processors. The results show that parallelism can be well exploited by the GPUs. The speedup values observed were up to 21x compared to a single processor system.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
N. Bandi, C. Sun, D. Agrawal, and A. E. Abbadi. Hardware acceleration in commercial databases: A case study of spatial operations. Technical report, Computer Science Department, University of California, Santa Barbara, 2004.
|
| |
2
|
|
 |
3
|
|
| |
4
|
M. Charalambous, P. Trancoso, and A. Stamatakis. Initial Experiences Porting a Bioinformatics Application to a Graphics Processor. In Proceedings of the 10th Panhellenic Conference on Informatics(PCI 2005), pages 415--425, November 2005.
|
| |
5
|
J. Cieslewicz and K. A. Ross. Database optimizations for modern hardware. Proceedings of the IEEE, 96(5):863--878, 2008.
|
| |
6
|
|
 |
7
|
Naga Govindaraju , Jim Gray , Ritesh Kumar , Dinesh Manocha, GPUTeraSort: high performance graphics co-processor sorting for large database management, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
[doi> 10.1145/1142473.1142511]
|
 |
8
|
Naga K. Govindaraju , Brandon Lloyd , Wei Wang , Ming Lin , Dinesh Manocha, Fast computation of database operations using graphics processors, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007594]
|
| |
9
|
N. K. Govindaraju, N. Raghuvanshi, M. Henson, and D. Manocha. A Cache-Efficient Sorting Algorithm for Database and Data Mining Computations using Graphics Processors. Technical report, UNC, 2005.
|
 |
10
|
|
| |
11
|
A. Greb and G. Zachmann. GPU-ABiSort: Optimal parallel sorting on stream architectures. In Proceedings of the 20th IEEE IPDPS, 2006.
|
| |
12
|
Michael Gschwind , H. Peter Hofstee , Brian Flachs , Martin Hopkins , Yukio Watanabe , Takeshi Yamazaki, Synergistic Processing in Cell's Multicore Architecture, IEEE Micro, v.26 n.2, p.10-24, March 2006
[doi> 10.1109/MM.2006.41]
|
| |
13
|
J. D. Hall and J. C. Hart. GPU Acceleration of Iterative Clustering. In Proceedings of the ACM Workshop on General Purpose Computing on Graphics Processors, August 2004.
|
| |
14
|
T. Jansen, B. v. Rymon-Lipinski, N. Hanssen, and E. Keeve. Fourier volume rendering on the gpu using a split-stream fft. In Proceedings of Vision, Modelling and Visualization, 2004.
|
| |
15
|
|
 |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
NVIDIA. NVIDIA CUDA Technology. http://developer.nvidia.com/object/cuda.html.
|
| |
20
|
Ordinal. Nsort: Fast parallel sorting. http://www.ordinal.com/.
|
 |
21
|
|
| |
22
|
Rapidmind. Writing Applications for the GPU Using the RapidMind Development Platform. http://www.rapidmind.net.
|
| |
23
|
Rapidmind. Rapidmind Platform. http://www.rapidmind.net, 2007.
|
| |
24
|
|
| |
25
|
|
 |
26
|
|
| |
27
|
|
| |
28
|
Transaction Processing Council. TPC Benchmark H (Decision Support) Standard Specification Revision 2.6.1. June 2006.
|
| |
29
|
J. Wang, T.-T. Wong, P.-A. Heng, and C.-S. Leung. Discrete wavelet transform on gpu. http://www.cse.cuhk.edu.hk/ttwong/demo/dwtgpu/dwtgpu.htm
|
 |
30
|
Samuel Williams , John Shalf , Leonid Oliker , Shoaib Kamil , Parry Husbands , Katherine Yelick, The potential of the cell processor for scientific computing, Proceedings of the 3rd conference on Computing frontiers, May 03-05, 2006, Ischia, Italy
[doi> 10.1145/1128022.1128027]
|
|