|
ABSTRACT
Client request rates for Internet services tend to be bursty and thus it is important to maintain efficient resource utilization under a wide range of load conditions. Network service clients typically seek services interactively and maintaining reasonable response time is often imperative for such services. In addition, providing differentiated service qualities and resource allocation to multiple service classes can also be desirable at times. This paper presents an integrated resource management framework (part of Neptune system) that provides flexible service quality specification, efficient resource utilization, and service differentiation for cluster-based services. This framework introduces the metric of quality-aware service yield to combine the overall system efficiency and individual service response time in one flexible model. Resources are managed through a two-level request distribution and scheduling scheme. At the cluster level, a fully decentralized request distribution architecture is employed to achieve high scalability and availability. Inside each service node, an adaptive scheduling policy maintains efficient resource utilization under a wide range of load conditions. Our trace-driven evaluations demonstrate the performance, scalability, and service differentiation achieved by the proposed techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
T. F. Abdelzaher and N. Bhatti. Web Server QoS Management by Adaptive Content Delivery. In International Workshop on Quality of Service, London, UK, June 1999.
|
| |
2
|
J. Almeida, M. Dabu, A. Manikutty, and P. Cao. Providing Differentiated Levels of Service in Web Content Hosting. In Proc. of SIGMETRICS Workshop on Internet Server Performance, Madison, WI, June 1998.
|
 |
3
|
Mohit Aron , Peter Druschel , Willy Zwaenepoel, Cluster reserves: a mechanism for resource management in cluster-based network servers, Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, p.90-101, June 18-21, 2000, Santa Clara, California, United States
|
| |
4
|
M. Aron, D. Sanders, P. Druschel, and W. Zwaenepoel. Scalable Content-aware Request Distribution in Cluster-based Network Services. In Proc. of the 2000 USENIX Annual 'Technical Conf., San Diego, CA, June 2000.
|
| |
5
|
Ask jeeves search. http://www.ask.com.
|
| |
6
|
|
| |
7
|
N. Bhatti and R. Friedrich. Web Server Support for Tiered Services. IEEE Network, 13(5):64--71, September 1999.
|
| |
8
|
J. Bruno, E. Gabber, B. Ozden, and A. Silberschatz. The Eclipse Operating System: Providing Quality of Service via Reservation Domains. In Proc. of USENIX Annual Technical Conf., pages 235--246, Orleans, LA, June 1998.
|
| |
9
|
S. Chandra, C. S. Ellis, and A. Vahdat. Differentiated Multimedia Web Services Using Quality Aware Transcoding. In Proc. of IEEE INFOCOM'2000, TelAviv, Israel, March 2000.
|
 |
10
|
Jeffrey S. Chase , Darrell C. Anderson , Prachi N. Thakar , Amin M. Vahdat , Ronald P. Doyle, Managing energy and server resources in hosting centers, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
| |
11
|
|
 |
12
|
Constantinos Dovrolis , Dimitrios Stiliadis , Parameswaran Ramanathan, Proportional differentiated services: delay differentiation and packet scheduling, Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication, p.109-120, August 30-September 03, 1999, Cambridge, Massachusetts, United States
|
 |
13
|
|
| |
14
|
J. Postel Ed. Transmission Control Protocol Specification. SRI International, Menlo Park, CA, September 1981. RFC-793.
|
| |
15
|
A. Feldmann. Characteristics of TCP Connection Arrivals. Technical report, AT&T Labs Research, 1998.
|
| |
16
|
|
 |
17
|
Armando Fox , Steven D. Gribble , Yatin Chawathe , Eric A. Brewer , Paul Gauthier, Cluster-based scalable network services, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.78-91, October 05-08, 1997, Saint Malo, France
|
| |
18
|
Google search. http://www.google.com.
|
| |
19
|
S. D. Gribble, E. A. Brewer, J. M. Hellerstein, and D. Culler. Scalable, Distributed Data Structures for Internet Service Construction. In Proc. of the 4th USENIX Symposium on Operating Systems Design and Implementation, San Diego, CA, October 2000.
|
| |
20
|
S. D. Gribble, M. Welsh, E. A. Brewer, and D. Culler. The MultiSpace: An Evolutionary Platform for Infrastructural Services. In Proc. of the USENIX Annual Technical Conf., Monterey, CA, June 1999.
|
 |
21
|
|
| |
22
|
|
| |
23
|
J. Huang, J. Stankovic, D. Towsley, and K. Ramamritham. Experimental Evaluation of Real-Time Transaction Processing. In Proc. of the Tenth IEEE Real-Time System Symposium, pages 144--153, Santa Monica, CA, 1989.
|
 |
24
|
Michael B. Jones , Daniela Roşu , Marcel-Cătălin Roşu, CPU reservations and time constraints: efficient, predictable scheduling of independent activities, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.198-211, October 05-08, 1997, Saint Malo, France
|
| |
25
|
R. M. Karp. Reducibility among combinatorial problems. In Complexity of Computer Computations, pages 85--103, March 1972.
|
 |
26
|
|
| |
27
|
K. Li and S. Jamin. A Measurement-Based Admission-Controlled Web Server. In Proc. of IEEE INFOCOM'2000, pages 651--659, Tel-Aviv, Israel, March 2000.
|
 |
28
|
Zhen Liu , Mark S. Squillante , Joel L. Wolf, On maximizing service-level-agreement profits, Proceedings of the 3rd ACM conference on Electronic Commerce, p.213-223, October 14-17, 2001, Tampa, Florida, USA
[doi> 10.1145/501158.501185]
|
| |
29
|
J. Mogul and K. K. Ramakrishnan. Eliminating Receive Livelock in an Interrupt-driven Kernel. In Proc. of USENIX Annual Technical Conf., San Diego, CA, January 1996.
|
| |
30
|
|
 |
31
|
Vivek S. Pai , Mohit Aron , Gaurov Banga , Michael Svendsen , Peter Druschel , Willy Zwaenepoel , Erich Nahum, Locality-aware request distribution in cluster-based network servers, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.205-216, October 02-07, 1998, San Jose, California, United States
|
 |
32
|
|
 |
33
|
Yasushi Saito , Brian N. Bershad , Henry M. Levy, Manageability, availability and performance in Porcupine: a highly scalable, cluster-based mail service, Proceedings of the seventeenth ACM symposium on Operating systems principles, p.1-15, December 12-15, 1999, Charleston, South Carolina, United States
|
| |
34
|
|
| |
35
|
K. Shen, T. Yang, L. Chu, J. L. Holliday, D. A. Kuschner, and H. Zhu. Neptune: Scalable Replication Management and Programming Support for Cluster-based Network Services. In Proc. of the 3rd USENIX Symposium on Internet Technologies and Systems, pages 197--208, San Francisco, CA, March 2001.
|
| |
36
|
David C. Steere , Ashvin Goel , Joshua Gruenberg , Dylan McNamee , Calton Pu , Jonathan Walpole, A feedback-driven proportion allocator for real-rate scheduling, Proceedings of the third symposium on Operating systems design and implementation, p.145-158, February 1999, New Orleans, Louisiana, United States
|
| |
37
|
I. Stoica and H. Zhang. LIRA: An Approach for Service Differentiation in the Internet. In Proc. of Nossdav, June 1998.
|
| |
38
|
D. G. Sullivan and M. I. Seltzer. Isolation with Flexibility: A Resource Management Framework for Central Servers. In Proc. of the 2000 USENIX Annual Technical Conf., San Diego, CA, June 2000.
|
| |
39
|
|
| |
40
|
C. A. Waldspurger and W. E. Weihl. Lottery Scheduling: Flexible Proportional-Share Resource Management. In Proc. of USENIX Operating Systems Design and Implementation Symposium, pages 1--11, Monterey, CA, November 1994.
|
| |
41
|
H. Zhu, H. Tang, and T. Yang. Demand-driven Service Differentiation for Cluster-based Network Servers. In Proc. of IEEE INFOCOM'2001, Anchorage, AK, April 2001.
|
CITED BY 23
|
|
|
|
|
Sameh Elnikety , Erich Nahum , John Tracey , Willy Zwaenepoel, A method for transparent admission control and request scheduling in e-commerce web sites, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
Kiran Nagaraja , Gustavo Gama , Ricardo Bianchini , Richard P. Martin , Wagner Meira Jr. , Thu D. Nguyen, Quantifying the Performability of Cluster-Based Services, IEEE Transactions on Parallel and Distributed Systems, v.16 n.5, p.456-467, May 2005
|
|
|
|
|
|
Radu Dobrescu , Matei Dobrescu , Stefan Mocanu , Sebastian Taralunga, Development platform for parallel image processing, Proceedings of the 6th WSEAS International Conference on Signal, Speech and Image Processing, p.31-36, September 22-24, 2006, Lisbon, Portugal
|
|
|
|
|
|
|
|
|
Amit P. Jardosh , Kimaya Mittal , Krishna N. Ramachandran , Elizabeth M. Belding , Kevin C. Almeroth, IQU: practical queue-based user association management for WLANs, Proceedings of the 12th annual international conference on Mobile computing and networking, September 23-29, 2006, Los Angeles, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sriram Govindan , Arjun R. Nath , Amitayu Das , Bhuvan Urgaonkar , Anand Sivasubramaniam, Xen and co.: communication-aware CPU scheduling for consolidated xen-based hosting platforms, Proceedings of the 3rd international conference on Virtual execution environments, June 13-15, 2007, San Diego, California, USA
|
|
|
Chunqiang Tang , Malgorzata Steinder , Michael Spreitzer , Giovanni Pacifici, A scalable application placement controller for enterprise data centers, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
Christophe Taton , Sara Bouchenak , Noel De Palma , Daniel Hagimont , Sylvain Sicard, Self-Sizing of Clustered Databases, Proceedings of the 2006 International Symposium on on World of Wireless, Mobile and Multimedia Networks, p.506-512, June 26-29, 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|