| Classification of software behaviors for failure detection: a discriminative pattern mining approach |
| Full text |
Mov
(17:26),
Pdf
(579 KB)
|
Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Paris, France
SESSION: Research track papers
table of contents
Pages 557-566
Year of Publication: 2009
ISBN:978-1-60558-495-9
|
|
Authors
|
|
David Lo
|
Singapore Management University, Singapore, Singapore
|
|
Hong Cheng
|
Chinese University of Hong Kong, Hong Kong, China
|
|
Jiawei Han
|
University of Illinois at Urbana-Champaign, Urbana-Champaign, USA
|
|
Siau-Cheng Khoo
|
National University of Singapore, Singapore, Singapore
|
|
Chengnian Sun
|
National University of Singapore, Singapore, Singapore
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 50, Downloads (12 Months): 149, Citation Count: 0
|
|
|
ABSTRACT
Software is a ubiquitous component of our daily life. We often depend on the correct working of software systems. Due to the difficulty and complexity of software systems, bugs and anomalies are prevalent. Bugs have caused billions of dollars loss, in addition to privacy and security threats. In this work, we address software reliability issues by proposing a novel method to classify software behaviors based on past history or runs. With the technique, it is possible to generalize past known errors and mistakes to capture failures and anomalies. Our technique first mines a set of discriminative features capturing repetitive series of events from program execution traces. It then performs feature selection to select the best features for classification. These features are then used to train a classifier to detect failures. Experiments and case studies on traces of several benchmark software systems and a real-life concurrency bug from MySQL server show the utility of the technique in capturing failures and anomalies. On average, our pattern-based classification technique outperforms the baseline approach by 24.68% in accuracy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
]]Mysql atomicity violation. http://bugs.mysql.com/bug.php?id=169.
|
| |
2
|
]]http://www.mysmu.edu/faculty/davidlo/kdd09.htm, 2009.
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
| |
8
|
]]C-C. Chang and C-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
|
| |
9
|
]]H. Cheng, X. Yan, J. Han, and C. Hsu. Discriminative frequent pattern analysis for effective classification. In ICDE, pages 716--725, 2007.
|
| |
10
|
|
| |
11
|
William Dickinson , David Leon , Andy Podgurski, Finding failures by cluster analysis of execution profiles, Proceedings of the 23rd International Conference on Software Engineering, p.339-348, May 12-19, 2001, Toronto, Ontario, Canada
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Monica Hutchins , Herb Foster , Tarak Goradia , Thomas Ostrand, Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria, Proceedings of the 16th international conference on Software engineering, p.191-200, May 16-21, 1994, Sorrento, Italy
|
| |
16
|
]]ITU-T. ITU-T Recommendation Z.120: Message Sequence Chart (MSC). 1999.
|
| |
17
|
|
| |
18
|
]]C. Liu, X. Yan, H. Yu, J. Han, and P.S. Yu. Mining behavior graphs for "backtrace" of noncrashing bugs. In SDM, 2005.
|
| |
19
|
|
 |
20
|
|
 |
21
|
|
 |
22
|
Shan Lu , Soyeon Park , Eunsoo Seo , Yuanyuan Zhou, Learning from mistakes: a comprehensive study on real world concurrency bug characteristics, Proceedings of the 13th international conference on Architectural support for programming languages and operating systems, March 01-05, 2008, Seattle, WA, USA
|
| |
23
|
|
| |
24
|
]]Java Trans. API Spec. java.sun.com/products/jta/.
|
| |
25
|
]]G. Tassey. The economic impacts of inadequate infrastructure for software testing. National Institute of Standards and Technology. Planning Report, 2002.
|
| |
26
|
]]The Apache Software Foundation. Jakarta commons/net. http://jakarta.apache.org/commons/net/.
|
| |
27
|
]]C. Warrender, S. Forrest, and B. Pearlmutter. Detecting intrusions using system calls: Alternative data models. In IEEE Symp. on Security and Privacy, 1999.
|
| |
28
|
]]W. Weimer and G. Necula. Mining temporal specifications for error detection. In Proc. of Int. Conf. on Tools and Algo. for the Const. and Ana. of Sys., 2005.
|
 |
29
|
|
|