|
ABSTRACT
Diagnosis and reconfiguration is concerned with the detection and location of faulty units and the minimization of the effect of faults within a digital system. This topic is motivated by the need for highly available systems that can continue essential operation, perhaps with degraded performance, when hardware failures occur. In recent years there has been a great progress in development of large-scale integrated (LSI) and microcomputer technology. This led to the consideration of systems of complex elements, such as networks of microprocessors. Today, multiprocessor structures are not only feasible but also highly attractive as cost-effective computer structures. Having a multiplicity of independent processors raises the opportunity for making the system self-diagnosable through interprocessor communication. Also the homogenity of the processors make it possible that after the occurrence of faults the system can reconfigure so that the effect of faults is minimized. In this paper we consider techniques for the design of systems which incorporate diagnostic and computational reconfiguration.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R.A. Short, "The Attainment of Reliable Digital Systems through the Use of Redundancy - a Survey," IEEE Computer Group News, vol. 2, No. 9, Sept., 1968.
|
| |
2
|
F.P. Mathur and A. Avizienis, "Reliability Analysis and Architecture of a Hybrid Redundant Digital System: Generalized Triple Modular Redendant with Self-Repair," AFIPS Conference Proceedings, vol. 36, (Spring Joint Computer Conference 1970), AFIPS Press, Montvale, J.J., pp. 375-383.
|
| |
3
|
A. Avizienis, G.C. Gilley, F.P. Mathur, A.A. Rennels, J.A. Rohr, and D.K. Rubin, "The STAR (Self-Testing and Reapairing) Computer An Investigation of the Theory and Practice of Fault Tolerant Computer Design," IEEE Transactions on Computers, vol. C-20, Nov. 1971, pp. 1312-1321.
|
| |
4
|
S.H. Fuller, J.K. Ousterhout, L. Raskin, P. J. Rubinfeld, P.J. Sindhu and R.J. Swan, "MultiMicroprocessors: An Overview and Working Example," Proceedings of the IEEE, vol. 66, No. 2, Feb. 1978, pp. 216-228.
|
| |
5
|
F. Saheban and A.D. Friedman, "A Survey and Methodology of Reconfigurable Multi-Module Systems," submitted to COMPSAC 78- Second International Conference on Computer Software and Application, Chicago, Ill. Nov. 14-16, 1978.
|
| |
6
|
F.P. Preparata, G. Metze, and R.T. Chien, "On the Connection Assignment Problem of Diagnosable Systems," IEEE Trans. on Electronic Computers, vol. EC-16, Dec. 1967, pp. 848-854.
|
 |
7
|
|
| |
8
|
K.N. Levitt, M.W. Green, and J. Goldberg, "A Study of the Data Comutation Problems in a Self-Repairable Multiprocessor," Spring Joint Computer Conference (AFIPS Conference Proceedings), Book Company, Washington, DC, 1968, pp. 515-156.
|
| |
9
|
Y. Paker and M. Bozyigit, "Variable Topology Multicomputer," Proc. Second Symposium Microprocessing and Microprogramming, Venice, 1976, pp. 135-156.
|
| |
10
|
Y. OLaga, H. Tajima, and R. Mori, "A Novel Multiprocessor Array," Proc. Second Euromicro Symposium on Microprocessing and Microprogramming, Venice, 1976, pp. 83-90.
|
| |
11
|
S.I. Kartashev and S.P. Kartashev, "A MultiComputer System with Software Reconfiguration of the Architecture," Proc. Eigth Int. Conf. Computer Performance SIGMETRICS/CMG VIII, 1977, Washington, D. C., pp. 271-286.
|
| |
12
|
B.R. Borgerson and R.F. Freitas, "A Reliability Model for Gracefully Degrading and Standby-sparing Systems," IEEE Trans. on Computers, vol. C-24, May 1975, pp. 517-525.
|
| |
13
|
B.R. Borgerson, "A fail-softly System for T Time Sharing Use," Digest of the 1972 International Symposium on Fault-Tolerant Computing, 1972, pp. 89-93.
|
| |
14
|
B.R. Borgerson and R.F. Freitas, "A Reliability Model for Gracefully Degrading and Standby-sparing Systems," IEEE Transaction on Computers, vol. C-24, May 1975, pp. 517-525.
|
| |
15
|
Y.W. Ng and A. Avizienis, "A Reliability Model for Gracefully Degrading and Repairable Fault-tolerant Systems," Digest of the 1977 International Symposium on Fault-Tolerant Computing, IEEE Computing Society Publications, June 1977, pp. 29-34.
|
| |
16
|
J. Losy, "Effect of Failure on Gracefully Degradable Systems," Digest of the 1977 International Symposium on Fault-Tolerant Computing, IEEE Computing Society Publications, June 1977, pp. 29-34.
|
| |
17
|
W.A. Wulf and C.G. Bell, "C.MMP-A Multi-mini Computer," Proc. AFIPS 1972 FJCC, 1972, pp. 756-777.
|
| |
18
|
K.E. Batcher, "Sorting Network and Their Applicatios," 1968 SJCC, AFIPS Conference Proceedings, vol. 37, 1968, pp. 307-314.
|
| |
19
|
|
|