| Rewind, repair, replay: three R's to dependability |
| Full text |
Pdf
(146 KB)
|
| Source
|
ACM SIGOPS European Workshop
archive
Proceedings of the 10th workshop on ACM SIGOPS European workshop
table of contents
Saint-Emilion, France
SESSION: Robust service
table of contents
Pages: 70 - 77
Year of Publication: 2002
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 12, Citation Count: 4
|
|
|
ABSTRACT
Motivated by the growth of web and infrastructure services and their susceptibility to human operator-related failures, we introduce system-level undo as a recovery mechanism designed to improve service dependability. Undo enables system operators to recover from their inevitable mistakes and furthermore enables retroactive repair of problems that were not fixed quickly enough to prevent detrimental effects. We present the "three R's", a model of undo that matches the needs of human error recovery and retroactive repair; discuss several of the issues raised by this undo model; and introduce an initial architectural framework for undoable systems using the example of an undoable e-mail service system.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
A. Brown and D. A. Patterson. To Err is Human. Proc. 2001 Workshop on Evaluating and Architecting System dependabilitY, Göteborg, Sweden, July 2001.
|
 |
3
|
|
 |
4
|
|
 |
5
|
W. Keith Edwards , Takeo Igarashi , Anthony LaMarca , Elizabeth D. Mynatt, A temporal model for multi-level undo and redo, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.31-40, November 06-08, 2000, San Diego, California, United States
[doi> 10.1145/354401.354409]
|
| |
6
|
E. N. Elnozahy, D. B. Johnson, and Y. M. Wang. A Survey of Rollback-Recovery Protocols in Message-Passing Systems. CMU TR 96--181, Carnegie Mellon, 1996.
|
| |
7
|
P. Enriquez, A. Brown, and D. A. Patterson. Lessons from the PSTN for Dependable Computing. Proc. 2002 Workshop on Self-Healing, Adaptive and self-MANaged Systems (SHAMAN), New York, June 2001.
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
D. Kurlander and S. Feiner. Editable Graphical Histories. Proc 1988 IEEE Workshop on Visual Languages, Pittsburgh, PA, October 1988.
|
| |
12
|
D. E. Lowell, S. Chandra, and P. Chen. Exploring Failure Transparency and the Limits of Generic Recovery. Proc. 4th OSDI. San Diego, CA, October 2000.
|
 |
13
|
|
| |
14
|
D. Oppenheimer and D. A. Patterson. Why do Internet services fail, and what can be done about it? Proc. 10th ACM SIGOPS European Workshop. Saint-Emilion, France, September 2002.
|
| |
15
|
J. Reason. Human Error. Cambridge University Press, 1990.
|
 |
16
|
|
| |
17
|
Roxio, Inc. GoBack3. http://www.roxio.com/en/products/goback/index.jhtml.
|
CITED BY 4
|
|
|
|
|
|
|
|
Michael E. Locasto , Angelos Stavrou , Gabriela F. Cretu , Angelos D. Keromytis, From STEM to SEAD: speculative execution for automated defense, 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference, p.1-14, June 17-22, 2007, Santa Clara, CA
|
|
|
|