| Mars: a MapReduce framework on graphics processors |
| Full text |
Pdf
(261 KB)
|
Source
|
PACT
archive
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
table of contents
Toronto, Ontario, Canada
SESSION: Middleware and runtime systems
table of contents
Pages 260-269
Year of Publication: 2008
ISBN:978-1-60558-282-5
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 55, Downloads (12 Months): 435, Citation Count: 5
|
|
|
ABSTRACT
We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming framework originally proposed by Google for the ease of development of web search applications on a large number of commodity CPUs. Compared with CPUs, GPUs have an order of magnitude higher computation power and memory bandwidth, but are harder to program since their architectures are designed as a special-purpose co-processor and their programming interfaces are typically for graphics applications. As the first attempt to harness GPU's power for MapReduce, we developed Mars on an NVIDIA G80 GPU, which contains over one hundred processors, and evaluated it in comparison with Phoenix, the state-of-the-art MapReduce framework on multi-core CPUs. Mars hides the programming complexity of the GPU behind the simple and familiar MapReduce interface. It is up to 16 times faster than its CPU-based counterpart for six common web applications on a quad-core machine.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
AMD CTM. http://ati.amd.com/products/streamprocessor/, 2007.
|
| |
3
|
Apache Hadoop. http://lucene.apache.org/hadoop/, 2006.
|
 |
4
|
|
 |
5
|
Ian Buck , Tim Foley , Daniel Horn , Jeremy Sugerman , Kayvon Fatahalian , Mike Houston , Pat Hanrahan, Brook for GPUs: stream computing on graphics hardware, ACM Transactions on Graphics (TOG), v.23 n.3, August 2004
|
| |
6
|
B. Catanzaro, N. Sundaram, and K. Keutzer. A map reduce framework for programming graphics processors. In Workshop on Software Tools for MultiCore Systems, 2008.
|
 |
7
|
|
| |
8
|
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Y. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In NIPS '07: Proceedings of Twenty-First Annual Conference on Neural Information Processing Systems. Neural Information Processing Systems Foundation, 2007.
|
 |
9
|
|
| |
10
|
|
| |
11
|
Folding@home. http://www.scei.co.jp/folding, 2007.
|
 |
12
|
Naga Govindaraju , Jim Gray , Ritesh Kumar , Dinesh Manocha, GPUTeraSort: high performance graphics co-processor sorting for large database management, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
[doi> 10.1145/1142473.1142511]
|
 |
13
|
Naga K. Govindaraju , Brandon Lloyd , Wei Wang , Ming Lin , Dinesh Manocha, Fast computation of database operations using graphics processors, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007594]
|
| |
14
|
M. Harris, J. Owens, S. Sengupta, Y. Zhang, and A. Davidson. Cudpp: Cuda data parallel primitives library. 2007.
|
 |
15
|
|
 |
16
|
Bingsheng He , Ke Yang , Rui Fang , Mian Lu , Naga Govindaraju , Qiong Luo , Pedro Sander, Relational joins on graphics processors, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
[doi> 10.1145/1376616.1376670]
|
| |
17
|
D. Horn. Lib GPU FFT, 2006.
|
| |
18
|
|
 |
19
|
|
 |
20
|
Michael D. Linderman , Jamison D. Collins , Hong Wang , Teresa H. Meng, Merge: a programming model for heterogeneous multi-core systems, Proceedings of the 13th international conference on Architectural support for programming languages and operating systems, March 01-05, 2008, Seattle, WA, USA
|
| |
21
|
|
| |
22
|
|
| |
23
|
NVIDIA Corp. . CUDA Occupancy Calculator, 2007.
|
| |
24
|
NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html, 2007.
|
| |
25
|
OpenGL. http://www.opengl.org/, 2007.
|
| |
26
|
J. D. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. E. Lefohn, and T. J. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26, 2007.
|
| |
27
|
|
| |
28
|
|
| |
29
|
SETI@home. http://setiathome.berkeley.edu/, 2007.
|
 |
30
|
|
| |
31
|
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wenbin Fang , Mian Lu , Xiangye Xiao , Bingsheng He , Qiong Luo, Frequent itemset mining on graphics processors, Proceedings of the Fifth International Workshop on Data Management on New Hardware, June 28-28, 2009, Providence, Rhode Island
|
|