|
ABSTRACT
Server and storage clustering has become a popular platform for hosting large-scale online services. Elements of the service clustering support are often constructed using centralized or hierarchical architectures, in order to meet performance and policy objectives desired by online applications. For instance, a central Executive node can be employed to make efficient resource management decisions based on a complete view of cluster-wide resource availability as well as request demands. Functionality symmetric software architecture can enhance the robustness of cluster-based network services due to its inherent absence of vulnerability points. However, such a design must satisfy performance requirements and policy objectives desired by online services. This paper argues for the improved robustness of functionally symmetric architectures and presents the designs of two specific clustering support elements: energy-conserving server consolidation and service availability management. Our emulation and experimentation on a 117-server cluster show that the proposed designs do not significantly compromise the system performance and policy objectives compared with the centralized approaches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Anderson , Z. Bai , C. Bischof , L. S. Blackford , J. Demmel , Jack J. Dongarra , J. Du Croz , S. Hammarling , A. Greenbaum , A. McKenney , D. Sorensen, LAPACK Users' guide (third ed.), Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999
|
 |
2
|
Thomas E. Anderson , Michael D. Dahlin , Jeanna M. Neefe , David A. Patterson , Drew S. Roselli , Randolph Y. Wang, Serverless network file systems, ACM Transactions on Computer Systems (TOCS), v.14 n.1, p.41-79, Feb. 1996
[doi> 10.1145/225535.225537]
|
 |
3
|
Mohit Aron , Peter Druschel , Willy Zwaenepoel, Cluster reserves: a mechanism for resource management in cluster-based network servers, Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, p.90-101, June 18-21, 2000, Santa Clara, California, United States
|
| |
4
|
[4] Ask Jeeves Search. http://www.ask.com.
|
| |
5
|
|
| |
6
|
[6] D. A. Benson, I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, B. A. Rapp, and D. L. Wheeler. GenBank. Nucleic Acids Research, 30(1):17-20, 2002.
|
| |
7
|
[7] H. Casanova and J. Dongarra. NetSolve: A Network Server for Solving Computational Science Problems. International Journal of Supercomputer Applications and High Performance Computing, 11(3):212-223, 1997.
|
 |
8
|
Jeffrey S. Chase , Darrell C. Anderson , Prachi N. Thakar , Amin M. Vahdat , Ronald P. Doyle, Managing energy and server resources in hosting centers, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
| |
9
|
[9] F. Cristian and F. Schmuck. Agreeing on Processor Group Membership in Timed Asynchronous Distributed Systems. Technical Report CSE95-428, Dept. of Computer Science, UC San Diego, 1995.
|
| |
10
|
|
 |
11
|
Frank Dabek , M. Frans Kaashoek , David Karger , Robert Morris , Ion Stoica, Wide-area cooperative storage with CFS, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
 |
12
|
|
| |
13
|
[13] eBay Online Auctions. http://www.ebay.com.
|
| |
14
|
[14] I. Foster, C. Kesselman, J. M. Nick, and S. Tuecke. The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. Open Grid Service Infrastructure WG, Global Grid Forum, June 2002. http://www.globus.org/research/papers/ogsa.pdf.
|
 |
15
|
Armando Fox , Steven D. Gribble , Yatin Chawathe , Eric A. Brewer , Paul Gauthier, Cluster-based scalable network services, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.78-91, October 05-08, 1997, Saint Malo, France
|
 |
16
|
|
| |
17
|
[17] Google Search. http://www.google.com.
|
| |
18
|
Steven D. Gribble , Matt Welsh , Eric A. Brewer , David Culler, The multispace: an evolutionary platform for infrastructural services, Proceedings of the annual conference on USENIX Annual Technical Conference, p.12-12, June 06-11, 1999, Monterey, California
|
 |
19
|
John Kubiatowicz , David Bindel , Yan Chen , Steven Czerwinski , Patrick Eaton , Dennis Geels , Ramakrishna Gummadi , Sean Rhea , Hakim Weatherspoon , Chris Wells , Ben Zhao, OceanStore: an architecture for global-scale persistent storage, Proceedings of the ninth international conference on Architectural support for programming languages and operating systems, p.190-201, November 2000, Cambridge, Massachusetts, United States
|
 |
20
|
|
| |
21
|
|
| |
22
|
Kiran Nagaraja , Xiaoyan Li , Ricardo Bianchini , Richard P. Martin , Thu D. Nguyen, Using fault injection and modeling to evaluate the performability of cluster-based services, Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems, p.2-2, March 26-28, 2003, Seattle, WA
|
| |
23
|
|
 |
24
|
|
| |
25
|
David Oppenheimer , Archana Ganapathi , David A. Patterson, Why do internet services fail, and what can be done about it?, Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems, p.1-1, March 26-28, 2003, Seattle, WA
|
 |
26
|
Vivek S. Pai , Mohit Aron , Gaurov Banga , Michael Svendsen , Peter Druschel , Willy Zwaenepoel , Erich Nahum, Locality-aware request distribution in cluster-based network servers, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.205-216, October 02-07, 1998, San Jose, California, United States
|
| |
27
|
[27] E. Pinheiro, R. Bianchini, E. V. Carrera, and T. Heath. Dynamic Cluster Reconfiguration for Power and Performance. In Proc. of the Workshop on Compilers and Operating Systems for Low Power, Barcelona, Spain, September 2001.
|
 |
28
|
C. Greg Plaxton , Rajmohan Rajaraman , Andréa W. Richa, Accessing nearby copies of replicated objects in a distributed environment, Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, p.311-320, June 23-25, 1997, Newport, Rhode Island, United States
[doi> 10.1145/258492.258523]
|
 |
29
|
Suchitra Raman , Steven McCanne, A model, analysis, and protocol framework for soft state-based communication, Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication, p.15-25, August 30-September 03, 1999, Cambridge, Massachusetts, United States
|
 |
30
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
| |
31
|
[31] A. Ricciardi and K. Birman. Process Membership in Asynchronous Environments. Technical Report TR93-1328, Dept. of Computer Science, Cornell University, 1995.
|
| |
32
|
|
 |
33
|
Antony Rowstron , Peter Druschel, Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
 |
34
|
|
| |
35
|
|
| |
36
|
[36] K. Shen, T. Yang, and L. Chu. Clustering Support and Replication Management for Scalable Network Services. IEEE Trans. on Parallel and Distributed Systems - Special Issue on Middleware, 14(11):1168-1179, November 2003.
|
| |
37
|
Kai Shen , Tao Yang , Lingkun Chu , JoAnne L. Holliday , Douglas A. Kuschner , Huican Zhu, Neptune: scalable replication management and programming support for cluster-based network services, Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems, p.17-17, March 26-28, 2001, San Francisco, California
|
| |
38
|
|
 |
39
|
Ion Stoica , Robert Morris , David Karger , M. Frans Kaashoek , Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.149-160, August 2001, San Diego, California, United States
|
| |
40
|
W. Vogels , D. Dumitriu , K. Birman , R. Gamache , M. Massa , R. Short , J. Vert , J. Barrera , J. Gray, The Design and Architecture of the Microsoft Cluster Service - A Practical Approach to High-Availability and Scalability, Proceedings of the The Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing, p.422, June 23-25, 1998
|
| |
41
|
J. Robert von Behren , Eric A. Brewer , Nikita Borisov , Michael Chen , Matt Welsh , Josh MacDonald , Jeremy Lau , David E. Culler, Ninja: A Framework for Network Services, Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference, p.87-102, June 10-15, 2002
|
| |
42
|
[42] H. Zhu, H. Tang, and T. Yang. Demand-driven Service Differentiation for Cluster-based Network Servers. In Proc. of IEEE INFOCOM'2001, Anchorage, AK, April 2001.
|
|