ACM Home Page
Please provide us with feedback. Feedback
Cluster computing for web-scale data processing
Full text PdfPdf (323 KB)
Source
Technical Symposium on Computer Science Education archive
Proceedings of the 39th SIGCSE technical symposium on Computer science education table of contents
Portland, OR, USA
SESSION: Cluster and grid computing table of contents
Pages 116-120  
Year of Publication: 2008
ISBN:978-1-59593-799-5
Also published in ...
Authors
Aaron Kimball  University of Washington, Seattle, WA, USA
Sierra Michels-Slettvet  Department of Computer Science and Engineering, University of Washington, WA, USA
Christophe Bisciglia  Google, Inc., Mountain View, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGACCESS: ACM Special Interest Group on Accessible Computing
SIGCSE: ACM Special Interest Group on Computer Science Education
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 56,   Downloads (12 Months): 483,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1352135.1352177
What is a DOI?

ABSTRACT

In this paper we present the design of a modern course in cluster computing and large-scale data processing. The defining differences between this and previously published designs are its focus on processing very large data sets and its use of Hadoop, an open source Java-based implementation of MapReduce and the Google File System as the platform for programming exercises. Hadoop proved to be a key element for successfully implementing structured lab activities and independent design projects. Through this course, offered at the University of Washington in 2007, we imparted new skills on our students, improving their ability to design systems capable of solving web-scale problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
ACM/IEEE-CS Joint Curriculum Task Force. Computing Curricula 2001. IEEE Computer Society and Association for Computing Machinery., 2001.
2
 
3
4
5
 
6
7
 
8
Hadoop. http://lucene.apache.org/hadoop/.
 
9
Kimball and S. Michels-Slettvet. CSE 490H lecture notes: Problem solving on large scale clusters.
 
10
 
11
uwspr2007_clustercourse/listing.html, 2007.
12
 
13
Sahami. Scaling computer science education to education on scaling in computer science. Workshop on Integrative Computing Education & Research (ICER): Preparing IT Graduates for 2010 and Beyond, Jan. 2006.
14


Collaborative Colleagues:
Aaron Kimball: colleagues
Sierra Michels-Slettvet: colleagues
Christophe Bisciglia: colleagues