ACM Home Page
Please provide us with feedback. Feedback
Reference-driven performance anomaly identification
Full text PdfPdf (593 KB)
Source
Joint International Conference on Measurement and Modeling of Computer Systems archive
Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems table of contents
Seattle, WA, USA
SESSION: Computing and switching table of contents
Pages 85-96  
Year of Publication: 2009
ISBN:978-1-60558-511-6
Authors
Kai Shen  University of Rochester, Rochester, NY, USA
Christopher Stewart  University of Rochester, Rochester, NY, USA
Chuanpeng Li  University of Rochester, Rochester, NY, USA
Xin Li  University of Rochester, Rochester, NY, USA
Sponsors
ACM: Association for Computing Machinery
SIGMETRICS: ACM Special Interest Group on Measurement and Evaluation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 36,   Downloads (12 Months): 104,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1555349.1555360
What is a DOI?

ABSTRACT

Complex system software allows a variety of execution conditions on system configurations and workload properties. This paper explores a principled use of reference executions--those of similar execution conditions from the target--to help identify the symptoms and causes of performance anomalies. First, to identify anomaly symptoms, we construct change profiles that probabilistically characterize expected performance deviations between target and reference executions. By synthesizing several single-parameter change profiles, we can scalably identify anomalous reference-to-target changes in a complex system with multiple execution parameters. Second, to narrow the scope of anomaly root cause analysis, we filter anomaly-related low-level system metrics as those that manifest very differently between target and reference executions. Our anomaly identification approach requires little expert knowledge or detailed models on system internals and consequently it can be easily deployed. Using empirical case studies on the Linux I/O subsystem and a J2EE-based distributed online service, we demonstrate our approach's effectiveness in identifying performance anomalies over a wide range of execution conditions as well as multiple system software versions. In particular, we discovered five previously unknown performance anomaly causes in the Linux 2.6.23 kernel. Additionally, our preliminary results suggest that online anomaly detection and system reconfiguration may help evade performance anomalies in complex online systems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Realistic nonstationary online workloads. http://www.cs.rochester.edu/u/stewart/models.html.
 
2
MySQL JDBC driver. http://www.mysql.com/products/connector.
 
3
R.A. Fisher. The arrangement of field experiments. J. of the Ministry of Agriculture of Great Britain, 33:503--513, 1926.
 
4
M. Grindal, J. Offutt, and S.F. Andler. Combination testing strategies: A survey. Software Testing, Verification and Reliability, 15(3):167--199, Mar. 2005.
5
 
6
 
7
8
 
9
Linux kernel bug tracker. http://bugzilla.kernel.org/.
 
10
Linux kernel bug tracker on "many pre-mature anticipation timeouts in anticipatory I/O scheduler". http://bugzilla.kernel.org/show_bug.cgi?id=10756.
11
 
12
 
13
RUBiS: Rice University bidding system. http://rubis.objectweb.org.
 
14
 
15
16
 
17
18
19
20
 
21
22

Collaborative Colleagues:
Kai Shen: colleagues
Christopher Stewart: colleagues
Chuanpeng Li: colleagues
Xin Li: colleagues