|
ABSTRACT
Real production applications ranging from enterprise applications to large e-commerce sites share a crucial but seldom-noted characteristic: The relative frequencies of transaction types in their workloads are nonstationary, i.e., the transaction mix changes over time. Accurately predicting application-level performance in business-critical production applications is an increasingly important problem. However, transaction mix nonstationarity casts doubt on the practical usefulness of prediction methods that ignore this phenomenon. This paper demonstrates that transaction mix nonstationarity enables a new approach to predicting application-level performance as a function of transaction mix. We exploit nonstationarity to circumvent the need for invasive instrumentation and controlled benchmarking during model calibration; our approach relies solely on lightweight passive measurements that are routinely collected in today's production environments. We evaluate predictive accuracy on two real business-critical production applications. The accuracy of our response time predictions ranges from 10% to 16% on these applications, and our models generalize well to workloads very different from those used for calibration. We apply our technique to the challenging problem of predicting the impact of application consolidation on transaction response times. We calibrate models of two testbed applications running on dedicated machines, then use the models to predict their performance when they run together on a shared machine and serve very different workloads. Our predictions are accurate to within 4% to 14%. Existing approaches to consolidation decision support predict post-consolidation resource utilizations. Our method allows application-level performance to guide consolidation decisions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Andrzejak, M. Arlitt, and J. A. Rolia. Bounding the resource savings of utility computing models. Technical Report HPL-2002-339, HP Labs, Dec. 2002.
|
 |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
 |
6
|
|
| |
7
|
Paul Barham , Austin Donnelly , Rebecca Isaacs , Richard Mortier, Using magpie for request extraction and workload modelling, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, p.18-18, December 06-08, 2004, San Francisco, CA
|
| |
8
|
I. Barrodale and F. Roberts. An improved algorithm for discrete LI linear approximations. SIAM Journal of Numerical Analysis, 10:839--848, 1973.
|
 |
9
|
Forest Baskett , K. Mani Chandy , Richard R. Muntz , Fernando G. Palacios, Open, Closed, and Mixed Networks of Queues with Different Classes of Customers, Journal of the ACM (JACM), v.22 n.2, p.248-260, April 1975
[doi> 10.1145/321879.321887]
|
| |
10
|
|
| |
11
|
L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web caching and Zipf-like distributions: Evidence and implications. In INFOCOM, Mar. 1999.
|
 |
12
|
Jin Cao , William S. Cleveland , Dong Lin , Don X. Sun, On the nonstationarity of Internet traffic, Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, p.102-112, June 2001, Cambridge, Massachusetts, United States
|
 |
13
|
|
| |
14
|
|
| |
15
|
Ira Cohen , Moises Goldszmidt , Terence Kelly , Julie Symons , Jeffrey S. Chase, Correlating instrumentation data to system states: a building block for automated diagnosis and control, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, p.16-16, December 06-08, 2004, San Francisco, CA
|
 |
16
|
Ira Cohen , Steve Zhang , Moises Goldszmidt , Julie Symons , Terence Kelly , Armando Fox, Capturing, indexing, clustering, and retrieving system history, Proceedings of the twentieth ACM symposium on Operating systems principles, October 23-26, 2005, Brighton, United Kingdom
|
 |
17
|
|
 |
18
|
|
| |
19
|
Hewlett-Packard. OpenView Transaction Analyzer, Sept. 2006. http://h20229.www2.hp.com/products/tran/.
|
| |
20
|
R. Jain. The Art of Computer Systems Performance Analysis. John Wiley & Sons, 1991.
|
| |
21
|
The JBoss J2EE Application Server. http://www.jboss.com.
|
 |
22
|
|
| |
23
|
|
| |
24
|
T. Kelly and A. Zhang. Predicting performance in distributed enterprise applications. Technical Report HPL-2006-76, HP Labs, May 2006. http://www.hpl.hp.com/techreports/2006/HPL-2006-76.html.
|
| |
25
|
D. Krishnamurthy, J. A. Rolia, and S. Majumdar. SWAT: A tool for stress testing session-based Web applications. In Computer Measurement Group Conf., Dec. 2003.
|
| |
26
|
|
 |
27
|
|
| |
28
|
|
 |
29
|
Daniel Menascé , Virgílio Almeida , Rudolf Riedi , Flávia Ribeiro , Rodrigo Fonseca , Wagner Meira, Jr., In search of invariants for e-business workloads, Proceedings of the 2nd ACM conference on Electronic commerce, p.56-65, October 17-20, 2000, Minneapolis, Minnesota, United States
[doi> 10.1145/352871.352878]
|
 |
30
|
Daniel A. Menascé , Virgilio A. F. Almeida , Rodrigo Fonseca , Marco A. Mendes, A methodology for workload characterization of E-commerce sites, Proceedings of the 1st ACM conference on Electronic commerce, p.119-128, November 03-05, 1999, Denver, Colorado, United States
[doi> 10.1145/336992.337024]
|
| |
31
|
Mercury Interactive. Loadrunner load tester, Sept. 2006. http://www.mercury.com/us/products/performance-center/loadrunner/.
|
| |
32
|
MySQL database. http://www.mysql.com.
|
| |
33
|
J. Neter, M. H. Kutner, C. J. Nachtsheim, and W. Wasserman. Applied Linear Statistical Models. Irwin, fourth edition, 1996.
|
| |
34
|
C. R. Rao and H. Toutenburg. Linear Models: Least Squares and Alternatives. Springer, 1999.
|
 |
35
|
|
| |
36
|
|
| |
37
|
J. A. Rolia, X. Zhu, and M. Arlitt. Resource access management for a resource utility for commercial applications. In Int'l Sympos. on Integrated Network Mgmt. (IM), Mar. 2003.
|
| |
38
|
Rice University Bidding System (RUBiS), Mar. 2004. http://rubis.objectweb.org/.
|
| |
39
|
|
| |
40
|
|
| |
41
|
|
| |
42
|
The Stock Online Benchmark. http://forge.objectweb.org/projects/stock-online.
|
| |
43
|
D. W. Stroock. An Introduction to Markov Processes. Springer, May 2005.
|
| |
44
|
R. Talaber. Server consolidation assessments with VMware CapacityPlanner, Oct. 2005. http://downloads.vmware.com/vmworld/2005/pac196.pdf.
|
| |
45
|
The Open Group. Application response measurement (ARM), Sept. 2006. http://www.opengroup.org/management/arm.
|
 |
46
|
Eno Thereska , Brandon Salmon , John Strunk , Matthew Wachs , Michael Abd-El-Malek , Julio Lopez , Gregory R. Ganger, Stardust: tracking activity in a distributed storage system, Proceedings of the joint international conference on Measurement and modeling of computer systems, June 26-30, 2006, Saint Malo, France
|
| |
47
|
Transaction Processing Performance Council. TPC-W benchmark, Apr. 2005. http://www.tpc.org/tpcw/.
|
 |
48
|
Bhuvan Urgaonkar , Giovanni Pacifici , Prashant Shenoy , Mike Spreitzer , Asser Tantawi, An analytical model for multi-tier internet services and its applications, Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, June 06-10, 2005, Banff, Alberta, Canada
|
 |
49
|
|
| |
50
|
VMWare. Server capacity planning and consolidation, Oct. 2005. http://www.vmware.com/news/releases/vac_services.html.
|
 |
51
|
|
| |
52
|
R. R. Wilcox. Introduction to Robust Estimation and Hypothesis Testing. Elsevier, second edition, 2005.
|
CITED BY 13
|
|
Christopher Stewart , Terence Kelly , Alex Zhang , Kai Shen, A dollar from 15 cents: cross-platform management for internet services, USENIX 2008 Annual Technical Conference on Annual Technical Conference, p.199-212, June 22-27, 2008, Boston, Massachusetts
|
|
|
|
|
|
|
|
|
Mumtaz Ahmad , Ashraf Aboulnaga , Shivnath Babu , Kamesh Munagala, Modeling and exploiting query interactions in database systems, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Qi Zhang , Ludmila Cherkasova , Guy Mathews , Wayne Greene , Evgenia Smirni, R-Capriccio: a capacity planning and anomaly detection tool for enterprise services with live workloads, Proceedings of the ACM/IFIP/USENIX 2007 International Conference on Middleware, November 26-30, 2007, Newport Beach, California
|
|
|
|
|
|
Kai Shen , Christopher Stewart , Chuanpeng Li , Xin Li, Reference-driven performance anomaly identification, Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, June 15-19, 2009, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
C.
Computer Systems Organization
C.4
PERFORMANCE OF SYSTEMS
General Terms:
Experimentation,
Management,
Measurement,
Performance
Keywords:
LAR regression,
enterprise,
internet services,
mutli-tier,
noninvasive,
nonstationarity,
performance prediction,
realistic workloads
|