ACM Home Page
Please provide us with feedback. Feedback
A simulation-based study of scheduling mechanisms for a dynamic cluster environment
Full text PdfPdf (1.08 MB)
Source International Conference on Supercomputing archive
Proceedings of the 14th international conference on Supercomputing table of contents
Santa Fe, New Mexico, United States
Pages: 100 - 109  
Year of Publication: 2000
ISBN:1-58113-270-0
Authors
Yanyong Zhang  Department of Computer Science & Engineering, The Pennsylvania State University, University Park, PA
Anand Sivasubramaniam  Department of Computer Science & Engineering, The Pennsylvania State University, University Park, PA
Jose Moreira  IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, NY
Hubertus Franke  IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, NY
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 24,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/335231.335241
What is a DOI?

ABSTRACT

Scheduling of processes onto processors of a parallel machine has always been an important and challenging area of research. The issue becomes even more crucial and difficult as we gradually progress to the use of off-the-shelf workstations, operating systems, and high bandwidth networks to build cost-effective clusters for demanding applications. Clusters are gaining acceptance not just in scientific applications that need supercomputing power, but also in domains such as databases, web service and multimedia, which place diverse Quality-of-Service (QoS) demands on the underlying system. Further, these applications have diverse characteristics in terms of their computation, communication and I/O requirements, making conventional parallel scheduling solutions, such as space sharing or coscheduling, an unattractive option. At the same time, leaving it to the native operating system of each node to make decisions independently can lead to ineffective use of system resources whenever there is communication. Instead, an emerging class of dynamic coscheduling mechanisms, that attempt to take remedial actions to guide the system towards coscheduled execution without requiring explicit synchronization, offer a lot of promise for cluster scheduling. Using a detailed simulator, this paper evaluates the pros and cons of different dynamic coscheduling alternatives, while comparing their advantages over traditional coscheduling (and not performing any coordinated scheduling at all). The impact of dynamic job arrivals, job characteristics and different system parameters on these alternatives are evaluated in terms of several performance criteria.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
M. Buchanan and A. Chien. Coordinated Thread Scheduling for Workstation Clusters under Windows NT. In Proceedings of the USENIX Windows NT Workshop, August 1997.
3
 
4
D. G. Feitelson and L. Rudolph. Coscheduling based on Run-Time Identification of Activity Working Sets. Technical Report Research Report RC 18416(80519), IBM T. J. Watson Research Center, October 1992.
 
5
D. G. Peitelson and L. Rudolph. Gang Scheduling Performance Benefits for Fine-Grained Synchronization. Journal of Parallel and Distributed Computing, 16(4):306-318, December 1992.
6
 
7
 
8
 
9
10
 
11
 
12
J. K. Ousterhout. Scheduling Techniques for Concurrent Systems. In Proceedings of the 3rd International Conference on Distributed Computing Systems, pages 22-30, May 1982.
13
 
14
F. Petrini and W. Feng. Buffered Coscheduling: A New Method for Multitasking Parallel Jobs on Distributed Systems. Technical report, Los Alamos National Laboratory, September 1999.
 
15
 
16
R. Subrahmaniam. Implementing Coscheduling Heuristics for Windows NT Clusters. Master's thesis, Dept. of Computer Science and Engineering, Penn State University, University Park, PA 16802, October 1999.
 
17
Thinking Machines Corporation, Cambridge, Massachusetts. The Connection Machine CM-5 Technical Summary, October 1991.
 
18
Specification for the Virtual Interface Architecture. http://maw, viarch, org.
19
 
20
 
21
Y. Zhang, A. Sivasubramaniam, J. Moreira, and H. Franke. A Simulation-based Study of Scheduling Mechanisms for a Dynamic Cluster Environment. Technical Report CSE-99-022, Dept. of Computer Science and Engineering, The Pennsylvania State University, November 1999.


Collaborative Colleagues:
Yanyong Zhang: colleagues
Anand Sivasubramaniam: colleagues
Jose Moreira: colleagues
Hubertus Franke: colleagues