|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
ALSB76
|
|
| |
ANDE76
|
|
| |
ANDE78
|
ANDERSON, T., LEE, P. A., AND SHRIVAS- TAVA, S.K. "A model of recoverability in multilevel systems," IEEE Trans. Softw. Eng. SE-4, 6 (Nov. 1978), 486-494. The issues involved in supporting backward error recovery for abstract objects m layered multilevel interpretive systems are examined. Two recovery schemes (disjoint and inclusive) are presented for implementing recoverable objects
|
| |
BART77
|
BARTLETT, J.F., "A 'non stop' operating system," Tandem Computers Inc., Cupertino, Calif., 1977. The design of Guardian, the operating system for a fault-tolerant, general-purpose, multiple-computer commercial system, is discussed. Guardian ts constructed from processes which communicate via messages. Fault tolerance is achieved by duplicating hardware and software components and checkpointing state changes so that a backup process may take over in the case of a failure.
|
| |
BERN78
|
BERNSTEIN, P. A., ROTHNIE, J. B. JR., GOODMAN, N., AND PAPADIMITRIOU, C. A. "The concurrency control mechanism of SDD-I: A system for distributed databases. (the fully redundant case)," IEEE Trans. Soflw. Eng. SE-4, 3 (May 1978), 154-168. This paper describes a method for updating data stored redundantly in a network of database management systems. The overhead of always locking all copies of data being updated is avoided by predefining transaction classes that require different levels of synchronization
|
| |
BERN79
|
BERNSTEIN, P A., SHIPMAN, D. W., AND WONG, W.S. "Formal aspects of senallzability in database concurrency control," IEEE Trans. Soflw. Eng. SE-5, 3 (May 1979), 203-216. A formal model based on the use of a log of transaction read and write steps is developed for analyzing and comparing the behavior of different database concurrency control mechanisms. A simple centralized system that permits one read operation and one write operation per transaction is assumed for the analysis. Several proposed mechanisms that do not explicitly lock entities are shown to be equivalent to locking but to allow less concurrency than the conflict analysis method used in SDD-1
|
 |
BERN80a
|
|
 |
BERN80b
|
|
 |
COFF71
|
|
 |
DENN76
|
|
| |
ECKH78
|
ECKHOUSE, R. H. JR., AND STANKOVIC, J.A. "Issues in distributed processing-- An overview of two workshops," Computer 11, 1 (Jan. 1978), 22-26. A summary Gf two workshops on distributed computing held at Brown University m 1976 and 1977. The workshops had three broadly defined goals: (1) to define the field of distributed processing more precisely, (2) to examine the state of the art, and (3) to identify problem areas for future research.
|
| |
ELLI77
|
ELLIS, C. "A robust algorithm for updating duplicate databases," in Proc. 2nd Berkeley Workshop on Distributed Management of Data and Computer Networks," Berkeley, Calif., May 1977, pp. 146-158. Two algorithms, centralized and decentralized, for updating a distributed multiple copy database are formally specified using evaluation nets, a modified form of Petri net. The decentralized solution is robust with respect to node failure.
|
| |
ENSL78
|
ENSLOW, P. H. JR. "What is a 'distributed' data processing system?" Computer 11, 1 (Jan. 1978), 13-21. Since the term "distributed processing" has become overused and misused, this paper is an attempt to specify the characteristics of a new class of systems to be called "distributed data processing systems." These systems are characterized along three dimensions: hardware decentralization, control decentralization, and database decentralization. The systems of interest exhibit a high degree of distribution in all dimensions, as well as a high degree of cooperative autonomy in their overall operation.
|
 |
ESWA76
|
|
| |
GARC78
|
GARCIA-MOLINA, H. "Performance comparison of two update algorithms for distributed databases," m Proc. 3rd Berkeley Workshop on D~str~buted Data Manage. ment and Computer Networks, Berkeley, Calif., 1978, pp. 108-119. A detailed set of simulations is used to compare the performance of two update algorithms: centralized locking and distributed voting. The database is assumed to be fully replicated at each site, and updates are the only access operations. The results show that centralized locking performs considerably better than distributed voting on a wide variety of measures except in highload cases where both perform poorly.
|
| |
GARC79
|
GARCIA-MOLINA, H. "A concurrency control mechanism for distributed databases which uses centralized locking controllers," m Proc. 4th Berkeley Conf. D~stributed Data Management and Computer Networks, Berkeley, Calif., Aug. 1979, pp. 113-124. A concurrency control mechamsm based on independent centralized controllers which operate without explicit backup is proposed. A modified two-phase commit protocol that requires only a majority of acknowledgments from copies is used to perform updates.
|
| |
GLIG80
|
GLIGOR, V. D., AND SHATTUCK, S. H. "On deadlock detection in distributed systems," IEEE Trans. Soflw. Eng. SE-6, 5 (Sept. 1980), 435-440. A counterexample ts used to show that one of the two deadlock detection protocols proposed by Menasce and Muntz {MESA79}, the distributed protocol, is incorrect. Furthermore, it is argued that even a corrected version of this or any similar distributed protocol will be impractical due to performance considerations.
|
| |
GRAY75
|
GRAY, J. N., LORIE, R. A., AND PUTZOLU, G.R. "Granularity of locks in a shared data base," in Proc Int. Conference Very Large Data Bases, Sept. 1975, vol. 1, no. 1, pp. 428-451. A method for locking sets of resources described by tree and directed acychc graph hierarchies is presented. The protocol allows simultaneous locking at various granularities and is based on the introduction of additional lock modes that are assigned to the ancestor nodes.
|
| |
GRAY79
|
|
| |
HAMM78
|
HAMMER, M., AND SHIPMAN, D. "An overview of reliability mechanisms for a distributed data base system," in COMP. CON Spring "78 Proc, Feb. 1978, pp. 63- 65. Mechanisms to create a rehable network that guarantees the eventual delivery of messages and the broadcast of messages to a number of sites are discussed. This approach is being used to support the redundant update mechanisms of SDD-1, a distributed database system being designed and implemented by the Computer Corporation of America.
|
| |
JONE79
|
|
| |
KATZ77
|
KATZMAN, J.A. "A fault-tolerant computing system," Tandem Computers, inc., Cupertino, Calif., 1977. The hardware architecture of a fault-tolerant commercially available computer system is described. The design goals were (I) to provide continuous operation through any single hardware failure, and (2) to be able to repair failures while the system remains on-hne.
|
| |
LAGA79
|
|
 |
LAMP78
|
|
| |
LAMP76
|
LAMPSON, B. W., AND STURGIS, H. E "Crash recovery in a distributed storage system," unpublished paper, Computer Science Lab., Xerox Palo Alto Research Center, Palo Alto, Calif., 1976. An algorithm for maintaining the consistency of a distributed file system in the presence of individual crashes is presented. The algorithm is based on the use of an "intentions list" to implement atomic transactions.
|
| |
LAMP79
|
LAMPSON, B. W., AND STURGIS, H. E. "Crash recovery in a distributed storage system," unpublished paper, Computer Scmnce Lab., Xerox Palo Alto Research Center, Palo Alto, Calif., 1979. A revised and expanded algorithm is presented for maintaining the consistency and reliable storage of data in a distributed system when different portions of the database are updated as a single transaction. The approach is based on implementing a hierarchy of abstract machines.
|
 |
LEDG77
|
|
| |
LELA78
|
LE LANN, G. "Alg~;rithms for distributed data-sharing systems which use tickets," in Proc. 3rd Berkeley Workshop on Dis. trtbuted Data Management and Computer Networks, Berkeley, Calif., Aug. 1978, pp. 259-272. A virtual ring with ctrculating tickets is proposed as a solution to the problem of synchronizing access to shared data m a distributed system. Four distributed algorithms are described for acquinng tickets. The scheme is shown to survive failures of the ring controllers.
|
| |
LEUN79
|
LEUNG, J. Y. T., AND LAI, E. K. "On minimum cost recovery from system deadlock," IEEE Trans. Comput. C-28, 9 (Sept 1979), 671-677. Under the assumptlon that the cost of aborting each member of a set of deadlocked jobs is known, the problem of selectmg a minimum total cost deadlock breaking set is shown to be NP-complete. Three fast heuristics are proposed, and their worst case performance is analyzed.
|
| |
LIN79
|
LIN, W. K. "Concurrency control in a multiple copy distributed database system," m Proc. 4th Berkeley Conf. Distributed Data Management and Computer Networks, Berkeley, Calif., Aug. 1979, pp. 207-220. Modifications to the concurrency control mechanism of SDD-1 are proposed which elnninate the need for timestamps on data items and weaken the constraints of some of the read-write protocols.
|
 |
LOME77
|
|
 |
LORI77
|
|
 |
MENA78
|
|
| |
MENA79
|
MENASCE, D. A., AND MUNTZ, R. R. "Locking and deadlock detection in distributed data bases," IEEE Trans. Softw. Eng. SE-5, 3 (May 1979), 195-201. Two protocols for deadlock detectmn based on the detection of cycles in the transactionwait-for graph are presented and analyzed. The first is based on a hierarchy of lock controllers and the second on a partitioning of the resources by decentralized local controllers.
|
| |
MILE79
|
|
 |
PAPA79
|
|
| |
PEEB78
|
PEEBLES, R., AND MANNING, E. "System architecture for distributed data management," Computer 11, 1 (Jan. 1978), 40-47. The architectures for dtstmbuteri data management systems are classifted as either (1) integrated or loosely coupled multlprocessors, (2) homogeneous federation, or (3) heterogeneous federation. Five fundamental problems that must be addressed by any data management system are then discussed: (1) how to provide an integrated database, (2) where to store data m the system, (3) how to locate data, (4) how to control concurrent access, and (5) how to provide security and integrity.
|
| |
PEEB80
|
PEEBLES, R., AND DOPIRAK, T. "ADAPT: A guest system," in COMP. CON Spring "80 Proc., Feb. 1980. This paper describes the kernel of a chstributed operating system that runs on top of existing single host operating systems that are networked together. The purpose of the kernel is to transform the network of individual systems into a single unified system that will be easier for application programmers to use.
|
| |
RAND75
|
RANDELL, B. "System structure for software fault tolerance." IEEE Trans Softw. Eng. SE-1, 2 (June 1975), 220-232. A method of achieving software fault tolerance by means of standby sparing ~s proposed. The goal is to provide dependable error detection and recovery facihties by using a system structuring techmque called the recovery block, which is used to specify an acceptance test and alternate spare program components. Reliable systems are implemented by building multilevel interpretive virtual machines.
|
 |
RAND78
|
|
| |
RAND79
|
|
| |
REED78
|
|
 |
REED79a
|
|
 |
REED79b
|
|
 |
RIES79
|
|
 |
ROSE78
|
|
 |
ROTH80
|
J. B. Rothnie, Jr. , P. A. Bernstein , S. Fox , N. Goodman , M. Hammer , T. A. Landers , C. Reeve , D. W. Shipman , E. Wong, Introduction to a system for distributed databases (SDD-1), ACM Transactions on Database Systems (TODS), v.5 n.1, p.1-17, March 1980
[doi> 10.1145/320128.320129]
|
 |
RUSS77
|
|
| |
RYPK79
|
RYPKA, D. J., AND LUCIDO, A. P. "Deadlock detection and avoidance for shared logical resources," IEEE Trans. Softw. Eng. SE-5, 5 (Sept. 1979), 465-471. Modes are used to specify the manner m which processes may concurrently access logical resources. Mode compatibility is defined and used to specify deadlock detection and avoidance strategies.
|
 |
SALT78
|
|
| |
SCHL76
|
SCHLAGETER, G. "The problem of lock by value in large data bases," Comput J. 19, 1 (Feb. 1976), 17-20. The concept of lock by value in multiple-access database systems is developed, and a lock system to support this approach is proposed. The problems of implementing lock by value are discussed.
|
 |
SEVE76
|
|
| |
SHAP78
|
SHAPIRO, R. M., AND MILLSTEIN, R. E. "Failure recovery in a distributed data base system," in COMPCON Spring '78 Proc., Feb. 1978, pp. 66-70. A mechanism for maintaining the consistency of redundant copies of a database in the event of a crash of one or more of the sites is presented. Each site maintains a recent update history which is used to restore the crashed site.
|
| |
SHRI78
|
SHRIVASTAVA, S. K., AND BANATRE, J.P. "Reliable resource allocation between unreliable processes," IEEE Trans. Soflw. Eng. SE-4, 3 (May 1978), 230-241. Error recovery problems associated with interacting processes are discussed, and an argument for using separate recovery mechanisms for cooperative and competitive processes is put forward. A recovery mechanism based on new programming language features (recoverable monitors and ports) is developed for processes which compete for the use of shared resources. Tins work extends the recovery block approach.
|
 |
SILB80
|
|
 |
SMIT77a
|
|
 |
SMIT77b
|
|
| |
STON79
|
STONEBRAKER, M. "Concurrency control and consistency of multiple copies of data in distributed INGRES," IEEE Trans. Softw. Eng. SE-5, 3 (May 1979), 180-194. Dmtributed INGRES is a proposed dmtributed version of a relational database system designed and implemented by researchers at the University of California, Berkeley. This paper describes the design philosophy and presents rough outlines of the algorithms proposed for concurrency control, consistency of multiple copies of data, and system crash recovery. Concurrency control is based on the use of local locking and deadlock controllers which cooperate with a centralized controller (the SNOOP) for global deadlock detection.
|
 |
THOM79
|
|
 |
VERH78
|
|
| |
WATS79
|
WATSON, R. W., AND FLETCHER, J. G. "An architecture for support of network operating system services," in Proc. 4th Berkeley Conf. on Distributed Data Management and Networks, Aug. 1979, pp. 18-50. It is argued that network architectures should be designed explicitly to support a coherent network operating system. The suggested model has three levels of protocol: an interprocess communication layer, a service support layer, and a layer of standard services.
|
| |
WULF75
|
WULF, W.A. "Reliable hardware/software architecture," IEEE Trans. Soflw. Eng. SE-1, 2 (June 1975), 233-240. A strategy used to achieve reliability in a hardware/software system m discussed. The proposed approach is based on an extension of modular programming methodology to include dynamic error detection and recovery. Each module is responsible for maintaining the integrity of the abstraction it implements.
|
CITED BY 48
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Paul J. Leach , Paul H. Levine , James A. Hamilton , Bernard L. Stumpf, The file system of an integrated local network, Proceedings of the 1985 ACM thirteenth annual conference on Computer Science, p.309-324, March 1985, New Orleans, Louisiana, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
T. P. Daniell , R. C. Harding, Jr. , S. H. Nauckhoff, Dynamic replication, an overview, Proceedings of the May 16-19, 1983, national computer conference, May 16-19, 1983, Anaheim, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|