ACM Home Page
Please provide us with feedback. Feedback
Fault-tolerance in the Borealis distributed stream processing system
Full text PdfPdf (612 KB)
Source International Conference on Management of Data archive
Proceedings of the 2005 ACM SIGMOD international conference on Management of data table of contents
Baltimore, Maryland
SESSION: Research papers: streams table of contents
Pages: 13 - 24  
Year of Publication: 2005
ISBN:1-59593-060-4
Authors
Magdalena Balazinska  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Hari Balakrishnan  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Samuel Madden  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Michael Stonebraker  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 62,   Citation Count: 20
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066157.1066160
What is a DOI?

ABSTRACT

We present a replication-based approach to fault-tolerant distributed stream processing in the face of node failures, network failures, and network partitions. Our approach aims to reduce the degree of inconsistency in the system while guaranteeing that available inputs capable of being processed are processed within a specified time threshold. This threshold allows a user to trade availability for consistency: a larger time threshold decreases availability but limits inconsistency, while a smaller threshold increases availability but produces more inconsistent results based on partial data. In addition, when failures heal, our scheme corrects previously produced results, ensuring eventual consistency.Our scheme uses a data-serializing operator to ensure that all replicas process data in the same order, and thus remain consistent in the absence of failures. To regain consistency after a failure heals, we experimentally compare approaches based on checkpoint/redo and undo/redo techniques and illustrate the performance trade-offs between these schemes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Abadi et al. The design of the Borealis stream processing engine. In CIDR, Jan. 2005.
 
3
Abadi et al. The design of the Borealis stream processing engine. Technical Report CS-04-08, Department of Computer Science, Brown University, Jan. 2005.
 
4
G. Alonso and C. Mohan. WFMS: The next generation of distributed processing tools. In S. Jajodia and L. Kerschberg, editors, Advanced Transaction Models and Architectures. Kluwer, 1997.
 
5
Alonso et al. Exotica/FMQM: A persistent message-based architecture for distributed workflow management. In Proc. of IFIP WG8.1 Working Conf. on Information Systems for Decentralized Organizations, Aug. 1995.
 
6
A. Arasu, S. Babu, and J. Widom. The CQL continuous query language: Semantic foundations and query execution. Technical Report 2003-67, Stanford University, Oct. 2003.
7
8
9
10
 
11
 
12
D. Carney, U. Çetintemel, A. Rasin, S. Zdonik, M. Cherniack, and M. Stonebraker. Operator scheduling in a data stream manager. In 29th VLDB, Sept. 2003.
 
13
S. Chandrasekaran and M. J. Franklin. Remembrance of streams past: Overload-sensitive management of archived streams. In 30th VLDB, Sept. 2004.
 
14
Chandrasekaran et al. TelegraphCQ: Continuous dataflow processing for an uncertain world. In CIDR, Jan. 2003.
 
15
Cherniack et al. Scalable distributed stream processing. In CIDR, Jan. 2003.
16
17
18
19
20
21
22
 
23
 
24
M. Hsu. Special issue on workflow systems. IEEE Data Eng. Bulletin, 18(1), Mar. 1995.
 
25
 
26
27
 
28
Y.-N. Law, H. Wang, and C. Zaniolo. Query languages and data models for database sequences and data streams. In 30th VLDB, Sept. 2004.
29
 
30
Motwani et al. Query processing, approximation, and resource management in a data stream management system. In CIDR, Jan. 2003.
 
31
Naughton et al. The Niagara Internet query system. IEEE Data Eng. Bulletin, 24(2), June 2001.
 
32
33
34
35
36
 
37
R. E. Strom. Fault-tolerance in the SMILE stateful publish-subscribe system. In DEBS, May 2004.
 
38
N. Tatbul, U. Çetintemel, S. Zdonik, M. Cherniack, and M. Stonebraker. Load shedding in a data stream manager. In 29th VLDB, Sept. 2003.
39
 
40
The NTP Project. NTP: The Network Time Protocol. http://www.ntp.org/.
 
41
P. A. Tucker and D. Maier. Dealing with disorder. In MPDS, June 2003.
 
42
R. Urbano. Oracle Streams Replication Administrator's Guide, 10g Release 1 (10.1). Oracle Corporation, Dec. 2003.

CITED BY  20
Collaborative Colleagues:
Magdalena Balazinska: colleagues
Hari Balakrishnan: colleagues
Samuel Madden: colleagues
Michael Stonebraker: colleagues