ACM Home Page
Please provide us with feedback. Feedback
Integrated resource management for cluster-based Internet services
Full text PdfPdf (1.73 MB)
Source ACM SIGOPS Operating Systems Review archive
Volume 36 ,  Issue SI  (Winter 2002) table of contents
OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
SPECIAL ISSUE: Cluster resource management table of contents
Pages: 225 - 238  
Year of Publication: 2002
ISSN:0163-5980
Authors
Kai Shen  University of Rochester, Rochester, NY
Hong Tang  University of California, Santa Barbara, CA
Tao Yang  University of California, Santa Barbara, CA
Lingkun Chu  University of California, Santa Barbara, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 59,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/844128.844150
What is a DOI?

ABSTRACT

Client request rates for Internet services tend to be bursty and thus it is important to maintain efficient resource utilization under a wide range of load conditions. Network service clients typically seek services interactively and maintaining reasonable response time is often imperative for such services. In addition, providing differentiated service qualities and resource allocation to multiple service classes can also be desirable at times. This paper presents an integrated resource management framework (part of Neptune system) that provides flexible service quality specification, efficient resource utilization, and service differentiation for cluster-based services. This framework introduces the metric of quality-aware service yield to combine the overall system efficiency and individual service response time in one flexible model. Resources are managed through a two-level request distribution and scheduling scheme. At the cluster level, a fully decentralized request distribution architecture is employed to achieve high scalability and availability. Inside each service node, an adaptive scheduling policy maintains efficient resource utilization under a wide range of load conditions. Our trace-driven evaluations demonstrate the performance, scalability, and service differentiation achieved by the proposed techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
T. F. Abdelzaher and N. Bhatti. Web Server QoS Management by Adaptive Content Delivery. In International Workshop on Quality of Service, London, UK, June 1999.
 
2
J. Almeida, M. Dabu, A. Manikutty, and P. Cao. Providing Differentiated Levels of Service in Web Content Hosting. In Proc. of SIGMETRICS Workshop on Internet Server Performance, Madison, WI, June 1998.
3
 
4
M. Aron, D. Sanders, P. Druschel, and W. Zwaenepoel. Scalable Content-aware Request Distribution in Cluster-based Network Services. In Proc. of the 2000 USENIX Annual Technical Conf., San Diego, CA, June 2000.
 
5
Ask jeeves search, http://www.ask.com.
 
6
 
7
N. Bhatti and R. Friedrich. Web Server Support for Tiered Services. IEEE Network, 13(5):64--71, September 1999.
 
8
J. Bruno, E. Gabber, B. Ozden, and A. Silberschatz. The Eclipse Operating System: Providing Quality of Service via Reservation Domains. In Proc. of USENIX Annual Technical Conf., pages 235--246, Orleans, LA, June 1998.
 
9
S. Chandra, C. S. Ellis, and A. Vahdat. Differentiated Multimedia Web Services Using Quality Aware Transcoding. In Proc. of IEEE INFOCOM'2000, TelAviv, Israel, March 2000.
10
 
11
12
13
 
14
J. Postel Ed. Transmission Control Protocol Specification. SRI International, Menlo Park, CA, September 1981. RFC-793.
 
15
A. Feldmann. Characteristics of TCP Connection Arrivals. Technical report, AT&T Labs Research, 1998.
 
16
17
 
18
Google search, http://www.google.com.
 
19
S. D. Gribble, E. A. Brewer, J. M. Hellerstein, and D. Culler. Scalable, Distributed Data Structures for Internet Service Construction. In Proc. of the 4th USENIX Symposium on Operating Systems Design and Implementation, San Diego, CA, October 2000.
 
20
S. D. Gribble, M. Welsh, E. A. Brewer, and D. Culler. The MultiSpace: An Evolutionary Platform for Infrastructural Services. In Proc. of the USENIX Annual Technical Conf., Monterey, CA, June 1999.
21
 
22
 
23
J. Huang, J. Stankovic, D. Towsley, and K. Ramamritham. Experimental Evaluation of Real-Time Transaction Processing. In Proc. of the Tenth IEEE Real-Time System Symposium, pages 144--153, Santa Monica, CA, 1989.
24
 
25
R. M. Karp. Reducibility among combinatorial problems. In Complexity of Computer Computations, pages 85--103, March 1972.
26
 
27
K. Li and S. Jamin. A Measurement-Based Admission-Controlled Web Server. In Proc. of IEEE INFOCOM'2000, pages 651--659, Tel-Aviv, Israel, March 2000.
28
 
29
J. Mogul and K. K. Ramakrishnan. Eliminating Receive Livelock in an Interrupt-driven Kernel. In Proc. of USENIX Annual Technical Conf., San Diego, CA, January 1996.
 
30
31
32
33
 
34
 
35
K. Shen, T. Yang, L. Chu, J. L. Holliday, D. A. Kuschner, and H. Zhu. Neptune: Scalable Replication Management and Programming Support for Cluster-based Network Services. In Proc. of the 3rd USENIX Symposium on Internet Technologies and Systems, pages 197--208, San Francisco, CA, March 2001.
 
36
 
37
I. Stoica and H. Zhang. LIRA: An Approach for Service Differentiation in the Internet. In Proc. of Nossdav, June 1998.
 
38
D. G. Sullivan and M. I. Seltzer. Isolation with Flexibility: A Resource Management Framework for Central Servers. In Proc. of the 2000 USENIX Annual Technical Conf., San Diego, CA, June 2000.
 
39
 
40
C. A. Waldspurger and W. E. Weihl. Lottery Scheduling: Flexible Proportional-Share Resource Management. In Proc. of USENIX Operating Systems Design and Implementation Symposium, pages 1--11, Monterey, CA, November 1994.
 
41
H. Zhu, H. Tang, and T. Yang. Demand-driven Service Differentiation for Cluster-based Network Servers. In Proc. of IEEE INFOCOM'2001, Anchorage, AK, April 2001.


Collaborative Colleagues:
Kai Shen: colleagues
Hong Tang: colleagues
Tao Yang: colleagues
Lingkun Chu: colleagues