ACM Home Page
Please provide us with feedback. Feedback
Mars: a MapReduce framework on graphics processors
Full text PdfPdf (261 KB)
Source
PACT archive
Proceedings of the 17th international conference on Parallel architectures and compilation techniques table of contents
Toronto, Ontario, Canada
SESSION: Middleware and runtime systems table of contents
Pages 260-269  
Year of Publication: 2008
ISBN:978-1-60558-282-5
Authors
Bingsheng He  Hong Kong University of Science and Technology, Hong Kong
Wenbin Fang  Hong Kong University of Science and Technology, Hong Kong
Qiong Luo  Hong Kong University of Science and Technology, Hong Kong
Naga K. Govindaraju  Microsoft Corp., Seattle, WA, USA
Tuyong Wang  Sina Corp., Beijing, China
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 55,   Downloads (12 Months): 435,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1454115.1454152
What is a DOI?

ABSTRACT

We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming framework originally proposed by Google for the ease of development of web search applications on a large number of commodity CPUs. Compared with CPUs, GPUs have an order of magnitude higher computation power and memory bandwidth, but are harder to program since their architectures are designed as a special-purpose co-processor and their programming interfaces are typically for graphics applications. As the first attempt to harness GPU's power for MapReduce, we developed Mars on an NVIDIA G80 GPU, which contains over one hundred processors, and evaluated it in comparison with Phoenix, the state-of-the-art MapReduce framework on multi-core CPUs. Mars hides the programming complexity of the GPU behind the simple and familiar MapReduce interface. It is up to 16 times faster than its CPU-based counterpart for six common web applications on a quad-core machine.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
AMD CTM. http://ati.amd.com/products/streamprocessor/, 2007.
 
3
Apache Hadoop. http://lucene.apache.org/hadoop/, 2006.
4
5
 
6
B. Catanzaro, N. Sundaram, and K. Keutzer. A map reduce framework for programming graphics processors. In Workshop on Software Tools for MultiCore Systems, 2008.
7
 
8
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Y. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In NIPS '07: Proceedings of Twenty-First Annual Conference on Neural Information Processing Systems. Neural Information Processing Systems Foundation, 2007.
9
 
10
 
11
Folding@home. http://www.scei.co.jp/folding, 2007.
12
13
 
14
M. Harris, J. Owens, S. Sengupta, Y. Zhang, and A. Davidson. Cudpp: Cuda data parallel primitives library. 2007.
15
16
 
17
D. Horn. Lib GPU FFT, 2006.
 
18
19
20
 
21
 
22
 
23
NVIDIA Corp. . CUDA Occupancy Calculator, 2007.
 
24
NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html, 2007.
 
25
OpenGL. http://www.opengl.org/, 2007.
 
26
J. D. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. E. Lefohn, and T. J. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26, 2007.
 
27
 
28
 
29
SETI@home. http://setiathome.berkeley.edu/, 2007.
30
 
31


Collaborative Colleagues:
Bingsheng He: colleagues
Wenbin Fang: colleagues
Qiong Luo: colleagues
Naga K. Govindaraju: colleagues
Tuyong Wang: colleagues