ACM Home Page
Please provide us with feedback. Feedback
On the weakest failure detector ever
Full text PdfPdf (235 KB)
Source
Annual ACM Symposium on Principles of Distributed Computing archive
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing table of contents
Portland, Oregon, USA
SESSION: Fault tolerance table of contents
Pages: 235 - 243  
Year of Publication: 2007
ISBN:978-1-59593-616-5
Authors
Rachid Guerraoui  IC: EPFL and CSAIL: MIT
Maurice Herlihy  Brown University
Petr Kouznetsov  MPI-SWS
Nancy Lynch  CSAIL: MIT
Calvin Newport  CSAIL: MIT
Sponsors
SIGOPS: ACM Special Interest Group on Operating Systems
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1281100.1281135
What is a DOI?

ABSTRACT

Many problems in distributed computing are impossible when no information about process failures is available. It is common to ask what information about failures is necessary and sufficient to circumvent some specific impossibility, e.g., consensus, atomic commit, mutual exclusion, etc. This paper asks what information about failures is needed to circumvent any impossibility and sufficient to circumvent some impossibility. In other words, what is the minimal yet non-trivial failure informatio.

We present an abstraction, denoted Υ, that provides very little failure information. In every run of the distributed system, Υ eventually informs the processes that some set of processes in the system cannot be the set of correct processes in that run. Although seemingly weak, for it might provide random information for an arbitrarily long period of time, and it only excludes one possibility of correct set among many, Υ still captures non-trivial failure information. We show that Υ is sufficient to circumvent the fundamental wait-free set-agreement impossibility. While doing so, we (a) disprove previous conjectures about the weakest failure detector to solve set-agreement and we (b) prove that solving set-agreement with registers is strictly weaker than solving n+1-process consensus using n-process consensus. We prove that Υ is, in a precise sense, minimal to circumvent any wait-free impossibility. Roughly, we show that Υ is the weakest eventually stable failure detect or to circumvent any wait-free impossibility.

Our results are generalized through an abstraction Υf that we introduce and prove necessary to solve any problem that cannot be solved in an f-resilient manner, and yet sufficient to solve f-resilient f-set-agreement.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
 
5
6
 
7
8
 
9
R. Guerraoui and P. Kouznetsov. On failure detectors and type boosters. In Proceedings of the 17th International Symposium on Distributed Computing, pages 292--305. Springer-Verlag, 2003.
10
11
12
13
 
14
M. Raynal and C. Travers. In search of the holy grail: Looking for the weakest failure detector for wait-free set agreement. In OPODIS, pages 3--19, 2006.
15
16

CITED BY  7

Collaborative Colleagues:
Rachid Guerraoui: colleagues
Maurice Herlihy: colleagues
Petr Kouznetsov: colleagues
Nancy Lynch: colleagues
Calvin Newport: colleagues