ACM Home Page
Please provide us with feedback. Feedback
Resource overbooking and application profiling in a shared Internet hosting platform
Full text PdfPdf (749 KB)
Source
ACM Transactions on Internet Technology (TOIT) archive
Volume 9 ,  Issue 1  (February 2009) table of contents
Article No. 1  
Year of Publication: 2009
ISSN:1533-5399
Authors
Bhuvan Urgaonkar  The Penn State University, University Park, PA
Prashant Shenoy  University of Massachusetts, Amherst, MA
Timothy Roscoe  ETH Zürich, Zürich, Switzerland
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 26,   Downloads (12 Months): 316,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1462159.1462160
What is a DOI?

ABSTRACT

In this article, we present techniques for provisioning CPU and network resources in shared Internet hosting platforms running potentially antagonistic third-party applications. The primary contribution of our work is to demonstrate the feasibility and benefits of overbooking resources in shared Internet platforms. Since an accurate estimate of an application's resource needs is necessary when overbooking resources, we present techniques to profile applications on dedicated nodes, possibly while in service, and use these profiles to guide the placement of application components onto shared nodes. We then propose techniques to overbook cluster resources in a controlled fashion. We outline an empirical appraoch to determine the degree of overbooking that allows a platform to achieve improvements in revenue while providing performance guarantees to Internet applications. We show how our techniques can be combined with commonly used QoS resource allocation mechanisms to provide application isolation and performance guarantees at run-time. We implement our techniques in a Linux cluster and evaluate them using common server applications. We find that the efficiency (and consequently revenue) benefits from controlled overbooking of resources can be dramatic. Specifically, we find that overbooking resources by as little as 1% we can increase the utilization of the cluster by a factor of two, and a 5% overbooking yields a 300--500% improvement, while still providing useful resource guarantees to applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
Appleby, K., Fakhouri, S., Fong, L., Goldszmidt, M. K. G., Krishnakumar, S., Pazel, D., Pershing, J., and Rochwerger, B. 2001. Oceano—SLA-based management of a computing utility. In Proceedings of the IFIP/IEEE Symposium on Integrated Network Management. IEEE Computer Society Press, Los Alamitos, CA.
4
5
6
 
7
8
 
9
 
10
Berger, E., Kaplan, S., Urgaonkar, B., Sharma, P., Chandra, A., and Shenoy, P. 2003. Scheduler-aware virtual memory management. In Poster at the 19th ACM Symposium on Operating Systems Principles (SOSP 2003). ACM, New York.
 
11
Blanquer, J., Bruno, J., McShea, M., Ozden, B., Silberschatz, A., and Singh, A. 1999. Resource management for QoS in Eclipse/BSD. In Proceedings of the FreeBSD'99 Conference.
 
12
Boorstyn, R., Burchard, A., Liebeherr, J., and Oottamakorn, C. 2000. Statistical service assurances for traffic scheduling algorithms. IEEE J. Select. Areas Commun. 18, 12, 2651--2664.
 
13
 
14
15
 
16
Chandra, A., Goyal, P., and Shenoy, P. 2003b. Quantifying the benefits of resource multiplexing in on-demand data centers. In Proceedings of the 1st Workshop on Algorithms and Architectures for Self-Managing Systems.
 
17
 
18
19
20
 
21
Cherkasova, L. and Phaal, P. 1999. Session based admission control: A mechanism for improving performance of commercial web sites. In Proceedings of the 7th International Workshop on Quality of Service. IEEE Computer Society Press, Los Alamitos, CA.
 
22
 
23
 
24
 
25
Corba 2006. Corba documentation. http://www.omg.org/.
 
26
 
27
Davis, P. 1994. Airline ties profitability to yield management. SIAM News.
 
28
29
 
30
Edmonds, J. 1965. Maximum matching and a polyhedron with 0,1 - Vertices. J. Rese. NBS 69B.
31
32
 
33
34
35
36
37
 
38
GridForum02b 2002. Global grid forum: Scheduling and resource management working group. http://www-unix.mcs.anl.gov/schopf/ggf-sched/.
39
 
40
 
41
 
42
Iyer, R., Tewari, V., and Kant, K. 2000. Overload control mechanisms for web servers. In Proceedings of the Workshop on Performance and QoS of Next Generation Networks.
 
43
Jamjoom, H., Reumann, J., and Shin, K. 2000. QGuard: Protecting internet servers from overload. Tech. Rep. CSE-TR-427-00, Department of Computer Science, University of Michigan.
44
 
45
Kamra, A., Misra, V., and Nahum, E. 2004. Yaksha: A controller for managing the performance of 3-tiered websites. In Proceedings of the 12th International Workshop on Quality of Service (IWQoS). IEEE Computer Society Press, Los Alamitos, CA.
 
46
Kanodia, V. and Knightly, E. 2000. Multi-class latency-bounded web servers. In Proceedings of International Workshop on Quality of Service (IWQoS'00). IEEE Computer Society Press, Los Alamitos, CA.
 
47
Kelly, T., Cohen, I., Goldszmidt, M., and Keeton, K. 2004. Inducing models of black-box storage arrays. Tech. Rep. HPL-2004, HP Labs.
 
48
Knightly, E. and Shroff, N. 1999. Admission control for statistical QoS: Theory and practice. IEEE Network 13, 2. 20--29.
 
49
Kournev, S. and Buchmann, A. 2003. Performance modeling and evaluation of large-scale J2EE applications. In Proceedings of the International Conference of the Computer Measurement Group.
 
50
 
51
Leslie, I., McAuley, D., Black, R., Roscoe, T., Barham, P., Evers, D., Fairbairns, R., and Hyden, E. 1996. The design and implementation of an operating system to support distributed multimedia applications. IEEE J. Selected Areas in Communication, 14, 7, 1280--1297.
 
52
Levy, R., Nagarajarao, J., Pacifici, G., Spreitzer, M., Tantawi, A., and Youssef, A. 2003. Performance management for cluster based web services. In Proceedings of the IFIP/IEEE 8th International Symposium on Integrated Network Management. IEEE Computer Society Press, Los Alamitos, CA, Vol. 246. 247--261.
 
53
Li, S. and Jamin, S. 2000. A measurement-based admission-controlled web server. In Proceedings of the 9th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM 2000). IEEE Computer Society Press, Los Alamitos, CA.
 
54
 
55
Litzkow, M., Livny, M., and Mutka, M. 1988. Condor—A hunter of idle workstations. In Proceedings of the 8th International Conference of Distributed Computing Systems. 104--111.
 
56
Liu, T.-K., Kumaran, S., and Luo, Z. 2001. Layered queueing models for Enterprise Java Beans Applications. Tech. rep., IBM. June.
 
57
LTT02. The linux toolkit project page. http://www.opensys.com/LTT.
 
58
 
59
 
60
61
 
62
 
63
Papoulis, A. and Pillai, S. 2002. Probability, Random Variables and Stochastic Processes. McGraw-Hill, Englewood Cliffs, NJ.
 
64
pgbench 2002. The pgbench man page, postgresql software distribution.
 
65
Pradhan, P., Tewari, R., Sahu, S., Chandra, A., and Shenoy, P. 2002. An observation-based approach towards self-managing web servers. In Proceedings of the 10th International Workshop on Quality of Service (IWQoS 2002). IEEE Computer Society Press, Los Alamitos, CA.
 
66
Ranjan, S., Rolia, J., Fu, H., and Knightly, E. 2002. QoS-driven server migration for internet data centers. In Proceedings of the 10th International Workshop on Quality of Service (IWQoS). IEEE Computer Society Press, Los Alamitos, CA.
 
67
68
69
70
 
71
Sgi99 1999. React: Irix real-time extensions. http://www.sgi.com/software/react.
72
73
 
74
Slothouber, L. 1996. A model of web server performance. In Proceedings of the 5th International World Wide Web Conference.
 
75
Smith, B. C., Leimkuhler, J. F., and Darrow, R. M. 1992. Yield management at American Airlines. Interfaces, 22, 1, 8--31.
 
76
SPECWeb99. The Standard Performance Evaluation Corporation (SPEC). http://www.spec.org/.
 
77
 
78
Sun98b 1998. Solaris resource manager 1.0: Controlling system resources effectively. http://www.sun.com/software/white-papers/wp-srm.
79
 
80
Tang, P. and Tai, T. 1999. Network traffic characterization using token bucket model. In Proceedings of IEEE Infocom'99. IEEE Computer Society Press, Los Alamitos, CA.
81
82
 
83
 
84
85
86
 
87
88
 
89
Villela, D., Pradhan, P., and Rubenstein, D. 2004. Provisioning servers in the application tier for e-commerce systems. In Proceedings of the 12th International Workshop on Quality of Service (IWQoS). IEEE Computer Society Press, Los Alamitos, CA.
90
 
91
92
 
93
 
94
95
 
96
 
97
wsla. Web service level agreements (wsla) project. http://www.research.ibm.com/wsla.
98
 
99
Xu, W., Bodik, P., and Patterson, D. 2004. A flexible architecture for statistical learning and data mining from system log streams. In Proceedings of Workshop on Temporal Data Mining: Algorithms, Theory and Applications at the 4th IEEE International Conference on Data Mining (ICDM'04).
100
 
101

Collaborative Colleagues:
Bhuvan Urgaonkar: colleagues
Prashant Shenoy: colleagues
Timothy Roscoe: colleagues