ACM Home Page
Please provide us with feedback. Feedback
Teaching large scale data processing: the five-week course and two years' experiences
Full text PdfPdf (313 KB)
Source ACM International Conference Proceeding Series; Vol. 368 archive
Proceedings of the 1st ACM Summit on Computing Education in China on First ACM Summit on Computing Education in China table of contents
Beijing, China
SESSION: Papers table of contents
Article No. 2  
Year of Publication: 2008
ISBN:978-1-60558-441-6
Authors
Kang Chen  Tsinghua University, Beijing, China
Yubing Yin  Tsinghua University, Beijing, China
Weimin Zheng  Tsinghua University, Beijing, China
Sponsors
: Intellectual Ventures
Tsinghua University : Tsinghua University
: ACM
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 95,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1517632.1517635
What is a DOI?

ABSTRACT

We have setup a new course on the large scale data processing using clusters. It introduces the concepts and design of distributed systems. Many newly developed ideas such as Google file system and MapReduce programming framework for processing large scale data sets are introduced. Students will gain practical experience with distributed programming technologies via several small labs and one large multi-week final project. Labs and projects will be completed using Hadoop, an open-source implementation of Google's distributed file system and MapReduce programming model. We have taught this class named "Mass Data Processing Technology on Large Scale Clusters" for two years. This paper will describe the design, perform of the course as well as the experiences and lessons learned.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
 
6
S. G. Jeffrey Dean. Distributed programming with mapreduce. page Chapter 23, 2007.
7
 
8
9
 
10

Collaborative Colleagues:
Kang Chen: colleagues
Yubing Yin: colleagues
Weimin Zheng: colleagues