ACM Home Page
Please provide us with feedback. Feedback
Open commit protocols tolerating commission failures
Full text PdfPdf (3.21 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 18 ,  Issue 2  (June 1993) table of contents
Pages: 289 - 332  
Year of Publication: 1993
ISSN:0362-5915
Authors
Kurt Rothermel  Univ. of Stuttgart, Germany
Stefan Pappe  IBM European Networking Center
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 33,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/151634.151638
What is a DOI?

ABSTRACT

To ensure atomicity of transactions in distributed systems so-called 2-phase commit (2PC) protocols have been proposed. The basic assumption of these protocols is that the processing nodes involved in transactions are “sane,” i.e., they only fail with omission failures, and nodes eventually recover from failures. Unfortunately, this assumption is not realistic for so-called Open Distributed Systems (ODSs), in which nodes may have totally different reliability characteristics. In ODSs, nodes can be classified into trusted nodes (e.g., a banking server) and nontrusted nodes (e.g., a home PC requesting a remote banking service). While trusted nodes are assumed to be sane, nontrusted nodes may fail permanently and even cause commission failures to occur. In this paper, we propose a family of 2PC protocols that tolerate any number of omission failures at trusted nodes and any number of commission and omission failures at nontrusted nodes. The proposed protocols ensure that (at least) the trusted nodes participating in a transaction eventually terminate the transaction in a consistent manner. Unlike Byzantine commit protocols, our protocols do not incorporate mechanisms for achieving Byzantine agreement, which has advantages in terms of complexity: Our protocols have the same or only a slightly higher message complexity than traditional 2PC protocols.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
DECNET General Descrtptton. Doc. AA-K179A-TK. Digital Equipment Corporation.
 
3
DOLEV, D., AND STRONG, H.R. Distributed commit with bounded waiting. In Proceedings of the Second IEEE Syrup. on Reliability in Distributed Software and Database Systems (1982); see also IBM Research Report RJ3417, 1982.
 
4
 
5
GARCIA-MOLINA, H., AND ABBOTT, R.K. Reliable distributed database management. Proceedings of the IEEE 75, 5 (1987), 601 620.
 
6
7
8
 
9
Information Processing Systems Open Systems Interconnection Basic Reference Model. International Standard ISO 7498, 1984.
 
10
Information Processing Systems Open Systems Interconnection--Definition of Common Application Service Elements--Commitment, Concurrency and Recovery. Draft Internationa} Standard ISO 9804/2, 1989.
 
11
Information Processing Systems--Open Systems Interconnection Distributed Transaction Processing-Part 1: Model, Part 2: Service Definition, Part 3: Protocol Specification Draft Proposal ISO 10026/1-3, 1989.
 
12
13
 
14
LELANN, G. Error Recovem,. Lecture Notes in Computer Science 105, Springer, 1981, 371-376.
15
16
17
18
19
20
 
21
ROTHERMEL, K., AND PAPPE, S. Open commit protocols for the tree of processes model. In Proceedings of the lOth {nternatmnal Conference on Dzstrzbuted Computer Systems (Paris, 1990), 236 244.
 
22
23
 
24
SKEEN, D., AND STONEBRAKER, M. A formal model of crash recovery in a distributed system In Procee&ngs of the 5th Internatzonal Workshop on Dlstrzbuted Data Management and Computer Netu,orks (1981), 129 142
 
25
Systems Network Architecture Concepts and Products. IBM Corporation, GC30-3072, 1985
 
26
Systems Network Architecture, Format and Protocol Reference Manual: Architecture Logic For LU Type 6.2 IBM Corporation, GC30-3269, 1985.
 
27
SPECTOR, h., THOMPSON, n., PAUSCH, R,, EPPINGER, J. L., DAVES, R., DUCHAMP, D., DANIELS, D. S., AND BLOCH, J.J. Camelot: A distributed transaction facility for Mach and the Internet--an interim report. Tech. Rep, CMU-CS-87-129, Dept. of Computer Science, Carnegie Mellon Univ., Pittsburgh, Pa, 1987.
 
28
 
29
Introductmn to the Transactmn Momtorzng Fac~l~(~ (TMF) Tandem Computers Incorporated.
 
30
WALTER, B. A robust and efficmnt protocol for checking the availability of remote sites Comput. Netw. 6, 3 (1982), 173 188.
 
31
X/OPEN Portabihty GuLde. Elsevier, New York, 1987.


Collaborative Colleagues:
Kurt Rothermel: colleagues
Stefan Pappe: colleagues