ACM Home Page
Please provide us with feedback. Feedback
Understanding fault-tolerant distributed systems
Full text PdfPdf (6.17 MB)
Source
Communications of the ACM archive
Volume 34 ,  Issue 2  (February 1991) table of contents
Pages: 56 - 78  
Year of Publication: 1991
ISSN:0001-0782
Author
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 135,   Downloads (12 Months): 654,   Citation Count: 74
Additional Information:

references   cited by   index terms   review  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/102792.102801
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Abbadi, A.E., Skeen, D., Crisfian, Fo An efficient fault-tolerant protocol Fourth, ACM Conference on P?inciples of Database S~steras (1985).
 
2
 
3
Avizienis, A. Software fault tolerance. IFIP Comt~aer Com~'ress (San Francisco, Aug. i989),
 
4
avizienis. A,, Gunningberg, p.. Kelly J. strigini, L., Traverse, P., Tso, K., Voges, U. The UCLA Dedix system : A distributed tested for multi-version software. 15th lnternationd Conference on Fauh-tolerand Computing (Ann Afar, Michi 1985).
 
5
Babaoglu, O., Drumoind , R.streets of Byzantium: Network architetures for fast realible brodcast . IEEE Trans. Softw. Eng. SE-11, 6, (1985).
6
7
 
8
 
9
10
11
 
12
Carr,R. The Tandem Global update Protocol.Tandem Sys. Rev. i,2 (June 1985).
13
14
15
 
16
Comer, D., Perterson, L.Understanding. Distributred Comput. 3 (1989), 51-60.
 
17
Copper. E. Rep{licated distrubtation, programs. ph.D dissertration, UC Berkely, 1985.
 
18
Cristain, F.A. rigorous approach to faule-toleant programming. IEEE Trans. Softw. Eng. SE 11,1 (1985).
 
19
Cristain, F. Agreeing on who is present and who is absent in a synchronous distributed system. 18th International Conference on Fault- Tolerant Computing (Tokyo, June 1988).
 
20
Cristain, F. Exception handling. In Dependability of Resident Computers. T. Anderson, De., Blackwell Svientrific Publication, Oxford, 1989.
 
21
Cristain, F. Probailistic clock synchronization. Distributed Computing3 (1989), 146-158.
 
22
Cristain, F. Synchronous atomic broadcast for redundant broadcast channels. IBM Res. Rep. RJ 7203, Dec.1989.
 
23
Cristain, F., Aghilim H., Strong, R., Dolev, D. Atomic broadcast: From simple diffusion to Byzantine agreement. 15th International Conference on Fault-tolerant Computimng (Ann Arbor, Mich., 1985).
 
24
Cristain, F. Dancey, R. Dehn, J Fault-tolerant in the adacned automation system. 20th International Conference on Fault-tolerant Computing (Newcastle upon Tyne, England, June 1990).
 
25
Dijkstra. E. Hierarchuical ordering of sequential process. Acta Informatica 1 (1971), 115-138.
 
26
Ezhilchelvan, P., Shrivastave, S., A characterization of faultsmin systems. Fifth Symposium on Reliabil;ity in Distributed Software and Database systems (Los Angeles. Jan. 1989).
 
27
Gracia-Moilina, H., Spauster, A. Message ordring in a multicast environment. Nonth Intrernational Conference on Distirbuted Systems (Newport Beach, Calif., June 1989).
 
28
 
29
Gray, J. Why do Computers Sytop and what cna bne dpone about it? Fifth Symposium on Reliability in Distributed Softwarre and Database systems )Los Angles, Jan. 1986).
 
30
Harper, R., Lala, J., Deyst, J. Fault tolerant paralled processor architectuere overivew. 18th International Conference Fault-Tolerant Computing (Tokyo, June 1989).
 
31
Hopkins, A. Smith, B., Lala, J. FTMP-A highly reliable fault-toler ant multi-processor for aircraft. In Proceesings IEEE, Vol, 66. Ocy. 1978.
 
32
IBM International TEchnical Support Centeres. IMS/VS extended recovery faculity (XRF). Tech. Ref. 1978
 
33
Johson, D., Zwaenepoei, W. Sender based meeage logging. 17th Inernational Conference on Fault- Tolerant Computign (Tokyo, June 1987).
34
 
35
Knight, J., Amann, P. Issues infuencing the us of N-version programming in Processing of the IFIP congress(San Francisco, Aug. 1989).
 
36
 
37
Kopetz, H., Curnstedi, G., Resiinger, J. Fault-tolerant membership in a sunchronous real-times systems. IFIP Working Conference on Dependable Computing for Critical Aplications (Santa Barbara, Aug. 1989).
38
 
39
Ladin, R., Liskov, B., Shria, L., Lazy replication: A method for managing replicated data. Ninth Annual ACM Symposium on Prinicples of Distributed Computing (Aug. 1990).
40
 
41
Lamport, L., The part time Parli ment. Ces SRC Rep. 49, Sept. 1989.
 
42
Lamport, L., Sturgis, H., Atomic Transactions in Distributed Systems: An Advanced Course. Lecture Notes in Computing Science Vol. 105. Springer Verlag, 1981.
 
43
Laprie, J.C. Dpendability: A unifying Concept For Reliable Computing and Fault-tolerant, T. Anderson, Ed., Blackwell Scientific Publications, Oxford, 1989.
 
44
 
45
Le Lann, G. Critical issues in distributed realtimes computing. In preceedings of ESTEC Workship on communication Networks and Distribuuted Operating Systems within the Space Environment, European Space Agency REp. WPP-10, Noordwijk, Oct. 24-26, 1989.
 
46
Luan, S., Gligor, V. A fault-tolerant protocal for atomic broadcast. 10th International Conference on Distributed Computing Systems (Paris, May 1990).
 
47
McCluskey, E. Fault-tolerant systerms. Tech. Rep. CSL-199 Standfor Univ., 1982.
 
48
49
 
50
Palumbe, D., Butler, R. Measurement of SIFT operating system overhead. NASA Tech. Mem. 86322,1985
 
51
Parnas, D. Desigining software for ease of extension and contraction IEEE Trans Softw. Eng. Se-5, 2 (Mar. 1979).
 
52
Peterson, W., Weldon, E. Error Correcting Codes. MIT Press, Cambridge, Mass., 1972.
 
53
Powell, D. La tolerant aux fautes dasns les systems repats: Les hupothese d'erreur er Leur importance. LAAS REs. Rep. 89-258, Sept. 1989.
 
54
Randell, B. System structure for software fault-tolerant. IEEE Trans. Soft. Eng. SE-1,2 (1975).
55
 
56
 
57
 
58
59
 
60
Strong, R. Skeen, D., Cristian, F., Aghili, H. Handshake protocols. Seventh International Conference on Distributed Computing Systems (Berlin, Sept. 1978).
 
61
 
62
Taylor, D. and Wilison, G. The Strtus system architecture. In Dependability of Resilient Computer. T. An derson, Ed., Blackwell Scientific Publication, Oxford, 1989.
 
63
64
 
65
Wakerly. J. Error deteching codes, selfchecking circuits and applications. El servier Noth Holland, Inc., N.Y., 1978.
 
66
Wensely, J., Lamport, L., Goldberg, J., Green M., Levitt, K., Melliar- Smith, M., Shostak, R. Weinstock, C. SIFT : Design and analysis of a fault tolerant computer for aircraft contorl. Proceedings IEEE, Vol. 66, Oct. 1978.
67

CITED BY  74


REVIEW

"Robert Joel Hofkin : Reviewer"

Nomenclature is always a problem in rapidly developing areas such as fault-tolerant computing or distributed systems. What at first appears to be a serious disagreement may be nothing more than an unfortunate choice of words. We of  more...