|
ABSTRACT
A significant number of emerging on-line data analysis applications require the processing of data streams, large amounts of data that get updated continuously, to generate outputs of interest or to identify meaningful events. Example domains include network traffic management, stock price monitoring, customized e-commerce websites, and analysis of sensor data. In this paper we look at the problem of high availability in such a distributed stream processing system. By taking into account the particular characteristics of stream processing applications we first identify design principles for a replica placement algorithm for high availability. We incorporate these principles in a decentralized replica placement protocol that aims to maximize availability, while respecting resource constraints, and making performance-aware placement decisions. We have integrated our replica placement protocol in Synergy, our distributed stream processing middleware. Our experimental comparison over PlanetLab with the current state of the art corroborates our claims that our techniques maximize availability while sustaining good performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
D. Abadi et al. The design of the Borealis stream processing engine. In Proceedings of 2nd Biennial Conference on Innovative Data Systems Research, CIDR, Asilomar, CA, USA, January 2005.
|
 |
2
|
Atul Adya , William J. Bolosky , Miguel Castro , Gerald Cermak , Ronnie Chaiken , John R. Douceur , Jon Howell , Jacob R. Lorch , Marvin Theimer , Roger P. Wattenhofer, Farsite: federated, available, and reliable storage for an incompletely trusted environment, Proceedings of the 5th symposium on Operating systems design and implementation Due to copyright restrictions we are not able to make the PDFs for this conference available for downloading, December 09-11, 2002, Boston, Massachusetts
[doi> 10.1145/1060289.1060291]
|
| |
3
|
|
 |
4
|
Amitanand S. Aiyer , Lorenzo Alvisi , Allen Clement , Mike Dahlin , Jean-Philippe Martin , Carl Porth, BAR fault tolerance for cooperative services, Proceedings of the twentieth ACM symposium on Operating systems principles, October 23-26, 2005, Brighton, United Kingdom
|
 |
5
|
|
| |
6
|
A. Bartoli, R. Jimenez-Peris, B. Kemme, C. Pautasso, S. Patarin, S. Wheater, and S. Woodman. The ADAPT framework for adaptable and composable web services. IEEE Distributed Systems On Line, September 2005.
|
| |
7
|
Andy Bavier , Mic Bowman , Brent Chun , David Culler , Scott Karlin , Steve Muir , Larry Peterson , Timothy Roscoe , Tammo Spalink , Mike Wawrzoniak, Operating system support for planetary-scale network services, Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation, p.19-19, March 29-31, 2004, San Francisco, California
|
 |
8
|
|
| |
9
|
N. Budhlraja, K. Marzullo, F. B. Schneider, and S. Toueg. Primary-Backup protocols: Lower bounds and optimal implementations. In Cornell University Technical Report TR-92-1265, January 1992.
|
| |
10
|
Z. Cai, V. Kumar, B. Cooper, G. Eisenhauer, K. Schwan, and R. Strom. Utility-driven proactive management of availability in enterprise-scale information flows. In Proceedings of 7th Middleware, Melbourne, November 2006.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Jeong-Hyon Hwang , Magdalena Balazinska , Alexander Rasin , Ugur Cetintemel , Michael Stonebraker , Stan Zdonik, High-Availability Algorithms for Distributed Stream Processing, Proceedings of the 21st International Conference on Data Engineering, p.779-790, April 05-08, 2005
[doi> 10.1109/ICDE.2005.72]
|
| |
16
|
J. Hwang, Y. Xing, U. Çetintemel, and S. Zdonik. A cooperative, self-configuring high-availability solution for stream processing. In Proceedings of 23rd International Conference on Data Engineering, ICDE, Istanbul, Turkey, April 2007.
|
| |
17
|
|
| |
18
|
S. Krishnamurthy, W. Sanders, and M. Cukier. An adaptive quality of service aware middleware for replicated services. IEEE Transactions on Parallel and Distributed Systems, 14(11):1112--1125, November 2003.
|
| |
19
|
|
| |
20
|
Michael G. Merideth , Arun Iyengar , Thomas Mikalsen , Stefan Tai , Isabelle Rouvellou , Priya Narasimhan, Thema: Byzantine-Fault-Tolerant Middleware forWeb-Service Applications, Proceedings of the 24th IEEE Symposium on Reliable Distributed Systems, p.131-142, October 26-28, 2005
[doi> 10.1109/RELDIS.2005.28]
|
| |
21
|
Object Management Group. Fault tolerant CORBA. OMG Technical Committee Document formal / 02-06-59, Chapter 23, CORBA/IIOP 3.0.3, 2004.
|
 |
22
|
|
| |
23
|
Peter Pietzuch , Jonathan Ledlie , Jeffrey Shneidman , Mema Roussopoulos , Matt Welsh , Margo Seltzer, Network-Aware Operator Placement for Stream-Processing Systems, Proceedings of the 22nd International Conference on Data Engineering, p.49, April 03-07, 2006
[doi> 10.1109/ICDE.2006.105]
|
| |
24
|
C. Plattner, G. Alonso, and M. T. Özsu. DBFarm: A scalable cluster for multiple databases. In Proceedings of 7th Middleware, Melbourne, Australia, November 2006.
|
| |
25
|
Yansong (Jennifer) Ren , David E. Bakken , Tod Courtney , Michel Cukier , David A. Karr , Paul Rubel , Chetan Sabnis , William H. Sanders , Richard E. Schantz , Mouna Seri, AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects, IEEE Transactions on Computers, v.52 n.1, p.31-50, January 2003
[doi> 10.1109/TC.2003.1159752]
|
| |
26
|
T. Repantis, X. Gu, and V. Kalogeraki. Synergy: Sharing-aware component composition for distributed stream processing systems. In Proceedings of 7th Middleware, Melbourne, Australia, November 2006.
|
| |
27
|
T. Repantis and V. Kalogeraki. Alleviating hot-spots in peer-to-peer stream processing environments. In Proceedings of 5th International Workshop on Databases, Information Systems and Peer-to-Peer Computing, DBISP2P, Vienna, Austria, September 2007.
|
| |
28
|
T. Repantis and V. Kalogeraki. Hot-spot prediction and alleviation in distributed stream processing applications. In Proceedings of 38th International Conference on Dependable Systems and Networks, DSN, Anchorage, AK, USA, June 2008.
|
| |
29
|
|
| |
30
|
F. Schintke and A. Reinefeld. Modeling replica availability in large data grids. Grid Computing, 1(2):219--227, June 2003.
|
 |
31
|
|
 |
32
|
Mehul A. Shah , Joseph M. Hellerstein , Eric Brewer, Highly available, fault-tolerant, parallel dataflows, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007662]
|
 |
33
|
|
| |
34
|
|
 |
35
|
Bernard Wong , Aleksandrs Slivkins , Emin Gün Sirer, Meridian: a lightweight network location service without virtual coordinates, Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications, August 22-26, 2005, Philadelphia, Pennsylvania, USA
|
| |
36
|
|
| |
37
|
Kun-Lung Wu , Kirsten W. Hildrum , Wei Fan , Philip S. Yu , Charu C. Aggarwal , David A. George , Buǧra Gedik , Eric Bouillet , Xiaohui Gu , Gang Luo , Haixun Wang, Challenges and experience in prototyping a multi-modal stream analytic and monitoring application on System S, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
 |
38
|
|
| |
39
|
|
|