ACM Home Page
Please provide us with feedback. Feedback
The failure and recovery problem for replicated databases
Full text PdfPdf (708 KB)
Source Annual ACM Symposium on Principles of Distributed Computing archive
Proceedings of the second annual ACM symposium on Principles of distributed computing table of contents
Montreal, Quebec, Canada
Pages: 114 - 122  
Year of Publication: 1983
ISBN:0-89791-110-5
Authors
Sponsors
SIGOPS: ACM Special Interest Group on Operating Systems
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 53,   Citation Count: 39
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/800221.806714
What is a DOI?

ABSTRACT

A replicated database is a distributed database in which some data items are stored redundantly at multiple sites. The main goal is to improve system reliability. By storing critical data at multiple sites, the system can operate even though some sites have failed. However, few distributed database systems support replicated data, because it is difficult to manage as sites fail and recover. A replicated data algorithm has two parts. One is a discipline for reading and writing data item copies. The other is a concurrency control algorithm for synchronizing those operations. The read-write discipline ensures that if one transaction writes logical data item ×, and another transaction reads or writes x, there is some physical manifestation of that logical conflict. The concurrency control algorithm synchronizes physical conflicts; it knows nothing about logical conflicts. In a correct replicated data algorithm, the physical manifestation of conflicts must be strong enough so that synchronizing physical conflicts is sufficient for correctness. This paper presents a theory for proving the correctness of algorithms that manage replicated data. The theory is an extension of serializability theory. We apply it to three replicated data algorithms: Gifford's “quorum consensus” algorithm, Eager and Sevcik's “missing writes” algorithm, and Computer Corporation of America's “available copies” algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alsberg, P.A., G.G. Belford, J.D. Day, and E. Grapa, "Multi-copy Resiliency Techniques," Distributed Data Management (J.B. Rothnie, P.A. Bernstein, D.W. Shipman, eds.), IEEE, 1978, pp. 128-176.
 
2
Attar, R., P.A. Bernstein, and N. Goodman, "Site Initialization, Recovery, and Back-up in a Distributed Database System," Proc. 6th Berkeley Workshop, Feb. 1982, pp. 185-202.
3
 
4
Bernstein, P.A., N. Goodman, and V. Hadzillacos, "Recovery Algorithms for Database Systems," Proc. 9th IFIPS Congress, Sept. 1983.
5
 
6
Bernstein, P.A., D.W. Shipman, and W.S. Wong, "Formal Aspects of Serializability in Database Concurrency Control," IEEE Trans. on Software Engineering, SE-5, 3 (may 1979), pp. 203-215.
 
7
Dolev, D., "The Byzantine Generals Strike Again," J. of Algorithms, 3, 1 (1982).
8
9
 
10
 
11
12
13
14
15
 
16
Stearns, R.E., Lewis, P.M., II, and Rosen-krantz, D.J., "Concurrency Controls for Database Systems," Proc. of the 17th Annual Symp. on Foundations of Computer Science, IEEE, 1976, pp. 19-32.
17
18
19

CITED BY  39

Collaborative Colleagues:
Philip A. Bernstein: colleagues
Nathan Goodman: colleagues