ACM Home Page
Please provide us with feedback. Feedback
Replica placement for high availability in distributed stream processing systems
Full text PdfPdf (526 KB)
Source Distributed event-based systems; Vol. 332 archive
Proceedings of the second international conference on Distributed event-based systems table of contents
Rome, Italy
SESSION: Availability and reliability of event-based systems table of contents
Pages 181-192  
Year of Publication: 2008
ISBN:978-1-60558-090-6
Authors
Thomas Repantis  University of California, Riverside, CA
Vana Kalogeraki  University of California, Riverside, CA
Sponsors
: IEEE
: ACM
: USENIX
IFIP : International Federation for Information Processing
SIGSOFT: ACM Special Interest Group on Software Engineering
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 202,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1385989.1386012
What is a DOI?

ABSTRACT

A significant number of emerging on-line data analysis applications require the processing of data streams, large amounts of data that get updated continuously, to generate outputs of interest or to identify meaningful events. Example domains include network traffic management, stock price monitoring, customized e-commerce websites, and analysis of sensor data. In this paper we look at the problem of high availability in such a distributed stream processing system. By taking into account the particular characteristics of stream processing applications we first identify design principles for a replica placement algorithm for high availability. We incorporate these principles in a decentralized replica placement protocol that aims to maximize availability, while respecting resource constraints, and making performance-aware placement decisions. We have integrated our replica placement protocol in Synergy, our distributed stream processing middleware. Our experimental comparison over PlanetLab with the current state of the art corroborates our claims that our techniques maximize availability while sustaining good performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Abadi et al. The design of the Borealis stream processing engine. In Proceedings of 2nd Biennial Conference on Innovative Data Systems Research, CIDR, Asilomar, CA, USA, January 2005.
2
 
3
4
5
 
6
A. Bartoli, R. Jimenez-Peris, B. Kemme, C. Pautasso, S. Patarin, S. Wheater, and S. Woodman. The ADAPT framework for adaptable and composable web services. IEEE Distributed Systems On Line, September 2005.
 
7
8
 
9
N. Budhlraja, K. Marzullo, F. B. Schneider, and S. Toueg. Primary-Backup protocols: Lower bounds and optimal implementations. In Cornell University Technical Report TR-92-1265, January 1992.
 
10
Z. Cai, V. Kumar, B. Cooper, G. Eisenhauer, K. Schwan, and R. Strom. Utility-driven proactive management of availability in enterprise-scale information flows. In Proceedings of 7th Middleware, Melbourne, November 2006.
 
11
 
12
 
13
 
14
 
15
 
16
J. Hwang, Y. Xing, U. Çetintemel, and S. Zdonik. A cooperative, self-configuring high-availability solution for stream processing. In Proceedings of 23rd International Conference on Data Engineering, ICDE, Istanbul, Turkey, April 2007.
 
17
 
18
S. Krishnamurthy, W. Sanders, and M. Cukier. An adaptive quality of service aware middleware for replicated services. IEEE Transactions on Parallel and Distributed Systems, 14(11):1112--1125, November 2003.
 
19
 
20
 
21
Object Management Group. Fault tolerant CORBA. OMG Technical Committee Document formal / 02-06-59, Chapter 23, CORBA/IIOP 3.0.3, 2004.
22
 
23
 
24
C. Plattner, G. Alonso, and M. T. Özsu. DBFarm: A scalable cluster for multiple databases. In Proceedings of 7th Middleware, Melbourne, Australia, November 2006.
 
25
 
26
T. Repantis, X. Gu, and V. Kalogeraki. Synergy: Sharing-aware component composition for distributed stream processing systems. In Proceedings of 7th Middleware, Melbourne, Australia, November 2006.
 
27
T. Repantis and V. Kalogeraki. Alleviating hot-spots in peer-to-peer stream processing environments. In Proceedings of 5th International Workshop on Databases, Information Systems and Peer-to-Peer Computing, DBISP2P, Vienna, Austria, September 2007.
 
28
T. Repantis and V. Kalogeraki. Hot-spot prediction and alleviation in distributed stream processing applications. In Proceedings of 38th International Conference on Dependable Systems and Networks, DSN, Anchorage, AK, USA, June 2008.
 
29
 
30
F. Schintke and A. Reinefeld. Modeling replica availability in large data grids. Grid Computing, 1(2):219--227, June 2003.
31
32
33
 
34
35
 
36
 
37
38
 
39

Collaborative Colleagues:
Thomas Repantis: colleagues
Vana Kalogeraki: colleagues