ACM Home Page
Please provide us with feedback. Feedback
On designing direct dependency: based fast recovery algorithms for distributed systems
Full text PdfPdf (1.10 MB)
Source ACM SIGOPS Operating Systems Review archive
Volume 38 ,  Issue 1  (January 2004) table of contents
Pages: 58 - 73  
Year of Publication: 2004
ISSN:0163-5980
Authors
B. Gupta  Southern Illinois University, Carbondale, IL
Z. Liu  Southeast Missouri State University, Cape Girardeau, MO
Z. Liang  Southern Illinois University, Carbondale, IL
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 3,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/974104.974110
What is a DOI?

ABSTRACT

In this paper, we have proposed two recovery algorithms for distributed systems. Both algorithms follow a revolving centralized scheme. The direct dependency tracking of an integer representing the number of messages sent by each process has been shown to be sufficient to determine the maximum consistent state. The main feature of the recovery algorithms is that they are executed simultaneously by all the participating processes while determining the maximum consistent state. It thus ensures fast execution. The time overheads of the recovery algorithms are reduced further because both algorithms avoid some unnecessary comparisons while determining a consistent global checkpoint. The second algorithm has been shown to be faster than the first one, because it avoids, in general, much larger number of unnecessary comparisons compared to the first one; however the trade off is the increased amount of control information to be stored at each checkpoint in the second algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. N. Elnozahy, D. B. Johnson, and Y. M. Wang, "A Survey of Rollback-Recovery Protocols in Message-Passing System", Technical Report. No CMU-CS-96-181, School of Computer Science, Carnegie Mellon Univ., 1996.
2
 
3
 
4
5
 
6
 
7
 
8
T. Juang, and S. Venkatesan, "Crash Recovery with Little Overhead", Proc. 11th Intl Conf. Distributed Computing Systems, pp. 454--461, May 1991.
 
9
 
10
 
11
 
12
 
13