|
ABSTRACT
Vector timestamps can be used to characterize causality in a distributed computation. This is essential in an observation context where we wish to reason about the partial order of execution. Unfortunately, all current dynamic vector-timestamp algorithms require a vector of size equal to the number of processes in the computation. This fundamentally limits the scalability of such observation systems. In this paper we present a framework algorithm for dynamic vector timestamps whose size can be as small as the dimension of the partial order of execution. While the dimension can be as large as the number of processes, in general it is much smaller.The algorithm consists of three interleaved phases: computing the critical pairs, creating extensions that reverse those critical pairs, and assigning vectors to each event based on the extensions created. We present complete solutions for the first two phases and a partial solution for the third phase.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
{3} Colin Fidge. Fundamentals of distributed systems observation. Technical Report 93-15, Software Verification Research Centre, Department of Computer Science, The University of Queensland, St. Lucia, QLD 4072, Australia, November 1993.
|
| |
4
|
{4} Jerry Fowler and Willy Zwaenepoel. Causal distributed breakpoints. In Proceedings of the 10th IEEE International Conference on Distributed Computing Systems, pages 134-141. IEEE Computer Society Press, 1990.
|
| |
5
|
{5} Claude Jard and Guy-Vincent Jourdan. Dependency tracking and filtering in distributed computations. Technical Report 851, IRISA, Campus de Beaulieu - 35042 Rennes Cedex - France, August 1994.
|
| |
6
|
{6} Thomas Kunz, James P. Black, David J. Taylor, and Twan Basten. POET: Targetsystem independent visualisations of complex distributed-application executions. The Computer Journal, 40(8):499-512, 1997.
|
 |
7
|
|
| |
8
|
{8} Friedemann Mattern. Virtual time and global states of distributed systems. In M. Cosnard et al., editor, Proceedings of the International Workshop on Parallel and Distributed Algorithms, pages 215-226, Chateau de Bonas, France, December 1988. Elsevier Science Publishers B. V. (North Holland).
|
| |
9
|
{9} Oystein Ore. Theory of Graphs, volume 38. Amer. Math. Soc. Colloq. Publ., Providence, R.I., 1962.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
{14} James Alexander Summers. Precedence-Preserving Abstraction for Distributed Debugging. Master's thesis, University of Waterloo, Waterloo, Ontario, 1992.
|
| |
15
|
{15} David J. Taylor. Event displays for debugging and managing distributed systems. In Proceedings of the International Workshop on Network and Systems Management, pages 112-124, August 1995.
|
| |
16
|
{16} William T. Trotter. Combinatorics and Partially Ordered Sets: Dimension Theory. Johns Hopkins University Press, Baltimore, MD, 1992.
|
| |
17
|
{17} Paul Ward. On the scalability of distributed debugging: Vector clock size. Technical Report CS98-29, Shoshin Distributed Systems Group, Department of Computer Science, The University of Waterloo, Waterloo, Ontario, Canada N2L 3G1, December 1998. Available at ftp://cs-archive.uwaterloo.ca/csarchive/CS-98-29/CS-98-29.ps.Z.
|
| |
18
|
|
| |
19
|
|
| |
20
|
{20} Mihalis Yannakakis. The complexity of the partial order dimension problem. SIAM Journal on Algebraic and Discrete Methods, 3(3):351-358, September 1982.
|
INDEX TERMS
Primary Classification:
D.
Software
D.2
SOFTWARE ENGINEERING
D.2.4
Software/Program Verification
Subjects:
Formal methods
Additional Classification:
D.
Software
D.2
SOFTWARE ENGINEERING
D.2.5
Testing and Debugging
Subjects:
Distributed debugging
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.4
Systems and Software
Subjects:
Distributed systems
General Terms:
Algorithms,
Design,
Experimentation,
Theory,
Verification
Keywords:
Ore timestamp,
dimension,
distributed computation,
distributed-system observation,
partial order,
scalability,
vector timestamp
|