ACM Home Page
Please provide us with feedback. Feedback
Load balancing and fault tolerance in workstation clusters migrating groups of communicating processes
Full text PdfPdf (894 KB)
Source ACM SIGOPS Operating Systems Review archive
Volume 29 ,  Issue 4  (October 1995) table of contents
Pages: 25 - 36  
Year of Publication: 1995
ISSN:0163-5980
Authors
S. Petri  Braunschweig, Biiltenweg 74/75, D-38106 Braunschweig, Germany
H. Langendörfer  Braunschweig, Biiltenweg 74/75, D-38106 Braunschweig, German
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/219282.219288
What is a DOI?

ABSTRACT

In the past, several process migration facilities for distributed systems have been developed. Due to the complex nature of the subject, all those facilities have limitations that make them usable for only limited classes of applications and environments. We discuss some of the usual limitations and possible solutions. Specifically, we focus on migration of groups of collaborating processes between Unix systems without kernel modifications, and from this we derive the design for a migration system. First experiences with our implementation show that we reach performance figures for the migration that are close to those of real distributed operating system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
[1] Rafael Alonso and Kriton Kyrimis. A Process Migration Implementation for a Unix System. In Usenix Conference Proceedings, pages 365-372, Dallas, TX, February 1988.
 
2
 
3
 
4
[4] Geert Deconinck, Johan Vounckx, Rudi Cuyvers, and Rudy Lauwereins. Survey of Checkpointing and Rollback Techniques. Technical report, Katholieke Universiteit Leuven, Belgium, June 1993.
 
5
6
 
7
[7] Dan Freedman. Experience Building a Process Migration Subsystem for UNIX. In Usenix Conference Proceedings, pages 349- 356, Dallas, TX, January 1991.
 
8
[8] Chad Hunter. Process Cloning: A System for Duplicating UNIX Processes. In Usenix Conference Proceedings, pages 373- 379, Dallas, TX, February 1988.
 
9
[9] Michael Blair Jones. Transparently Interposing User Code at the System Interface. PhD thesis, CMU, September 1992.
 
10
[10] H. Langendörfer and B. Schnor. Verteilte Systeme. Hanser, München, 1994.
 
11
 
12
 
13
[13] Michael Litzkow and Marvin Solomon. Supporting Checkpointing and Process Migration Outside the UNIX Kernel. In Usenix Conference Proceedings, pages 283- 290, San Francisco, CA, January 1992.
 
14
[14] Michael J. Litzkow, Miron Livny, and Matt W. Mutka. Condor - A Hunter of Idle Workstations. In Proceedings of the 8th International Conference on Distributed Computer Systems, pages 104- 111. IEEE, June 1988.
 
15
[15] Thomas Ludwig. Automatische Lastverteilung für Parallelrechner. Reihe Informatik. BI- Wissenschaftsverlag, 1993.
 
16
[16] K. I. Mandelberg and V. S. Sunderam. Process Migration in UNIX Networks. In Usenix Conference Proceedings, pages 357- 363, Dallas, TX, February 1988.
 
17
 
18
 
19
20
 
21
[21] R. Sansom, D. Julin, and R. Rashid. Extending a Capability Based System into a Network Environment. CMU-CS-86-115, April 1986.
22
 
23
[23] Georg Stellner. Consistent checkpoints of pvm applications. In Proceedings of the First European PVM User Group Meeting, 1994.
 
24
[24] Sun Microsystems. SunOS Network Programming Guide, March 1990. Revision A.
 
25
[25] Sun Microsystems. SunOS Reference Manual , 1990. Revision A.
 
26
 
27
[27] Roman Zajcew, Paul Roy, David Black, Chris Peak, Paulo Guedes, Bradford Kemp, John LoVerso, Michael Lei bensper ger, Michael Barnett, Fa ramarz Rabii, and Dur riya Netterwa la. An OSF/1 UNIX for Massively Parallel Multicomputers. In Usenix Conference Proceedings, pages 449- 468, San Diego, CA, January 1993.
 
28
[28] Songnian Zhou, Jingwen Wang, Xiaohu Zheng, and Pierre Delisle. UTOPIA: A Load Sharing Facility for Large, Heterogeneous Distributed Computer Systems. TechnicM Report CSRI-257, CSRI, University of Toronto, April 1992.


Collaborative Colleagues:
S. Petri: colleagues
H. Langendörfer: colleagues