| Transparent fault tolerance for distributed Ada applications |
| Full text |
Pdf
(1.62 MB)
|
| Source
|
Annual International Conference on Ada
archive
Proceedings of the conference on TRI-Ada '94
table of contents
Baltimore, Maryland, United States
Pages: 446 - 457
Year of Publication: 1994
ISBN:0-89791-666-2
|
|
Authors
|
|
Mark A. Breland
|
Microelectronics and Computer Technology Corporation (MCC), 3500 West Balcones Center Drive, Austin, Texas
|
|
Steven A. Rogers
|
Microelectronics and Computer Technology Corporation (MCC), 3500 West Balcones Center Drive, Austin, Texas
|
|
Guillaume P. Brat
|
Department of Electrical and Computer, Engineering, The University of Texas at Austin, Austin, Texas
|
|
Kenneth L. Nelson
|
Computing Devices International, 8800 Queen Avenue South, Bloomington, Minnesota
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 1, Downloads (12 Months): 12, Citation Count: 2
|
|
|
ABSTRACT
The advent of open architectures and initiatives in massively parallel supercomputing, combined with the maturation of distributed processing methods and algorithms, has enabled the implementation of responsive software-based fault tolerance. Expanding capabilities of distributed Ada runtime environments further stimulate the incorporation of hardware fault tolerance into critical, realtime embedded systems. Through the integration of proven Ada program component distribution and virtually synchronous communication protocols, we have established a benchmark fault tolerant system, which layers transparently between an Ada application and the runtime environment. Such transparence allows rapid reconfiguration of distribution and fault tolerance characteristics without change to the source code, thus enhancing portability, scalability, and reuse.The Ada Fault Tolerance project has implemented software technologies which penetrate the envelope of an Ada program to detect, diagnose, and recover from hardware faults. These realtime facilities interact with the Rational distributed application development and runtime environment systems to service replicated Ada software tasks (i.e., threads of control). The deployed system proves that all replicated threads, including those of independently distributed components, can achieve timely consensus during periodic fault detection cycles through transparently embedded voting protocols. Our implementation uses a hybrid redundancy computation strategy and relies on a communication layer which provides virtual synchrony via a causal multicast protocol.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Kenneth Binnan and Robert Cooper, "The ISIS Project: Real Experience with a Fault Tolerant Programming System", Department of Computer Science, Comell University. Ithaca. New York. TR 90-1138. July 1990.
|
| |
3
|
|
| |
4
|
|
| |
5
|
Marc Chereque, David Powell, Philippe Reynier, Jean-Luc Richier, and Jacques Voiron, "Active Replication in Delta-4", in Proc. 22nd. Intl. Syrup. on Fault-Tolerant Computing (FTCS.22), pages 28-37, iEEE, Boston, MA, July 1992.
|
 |
6
|
|
| |
7
|
Flaviu Cristian, Richard de Beijer, and Shivakant Mishra, "Comparing How Well Asynchronous Atomic Broadcast Protocols Perform", in Proc. of the 3rd Workshop on Responsive Systems, pages 192-204, 1993.
|
| |
8
|
Greg Eisenhauer and Rakesh Jha, "Honeywell Distributed Aria - Approach", Distributed Ada: developments and experiences - Proceedings of the Distributed Ada '89 Symposium, University of Southampton, 11-12 December 1989, edited by Judy Bishop, Cambridge University Press, Cambridge, MA, 1990, pages 141-16 I.
|
| |
9
|
Steven G. Frison and John H. Wensley, "Interactive Consistency and Its Impact on the Design of TMR Systems", in Proc. 12th. Intl. Syrup. on Fault-Tolerant Computing (FTCS-12), pages 228-233, 1EEE, Santa Monica, CA, June 1982.
|
| |
10
|
|
| |
11
|
|
| |
12
|
P. M~iller and G. Hommel, "GranAda: A Programming Environment for Implementing Distributed Real-Time Applications", in Proc. Intl. Syrnp. on Artt~cial Intelligence in Real-Time Control, IFAC, Delft, The Netherlands, June 1992.
|
| |
13
|
BehroozParhami, "Optimal Algorithms for Exact, Inexact, and Approval Voting", in Proc. 22nd. Intl. Syrup. on Fault-Tolerant Computing (FTCS-22), pages 404-411, IEEE, Boston, MA, July 1992.
|
 |
14
|
Pat Rogers , Steve O'Neill , Marc Pitarys , Ken Littlejohn, Demonstrable fault tolerance for distributed Ada, Proceedings of the conference on TRI-Ada '93, p.180-188, September 18-23, 1993, Seattle, Washington, United States
[doi> 10.1145/170657.170713]
|
| |
15
|
|
CITED BY 2
|
|
|
|
|
Pedro de las Heras-Quirós , Jesús M. González-Barahona , José Centeno-González, Programming distributed fault tolerant systems: the replicAda approach, Proceedings of the conference on TRI-Ada '97, p.21-29, November 09-13, 1997, St. Louis, Missouri, United States
|
|