| An integrated framework on mining logs files for computing system management |
| Full text |
Pdf
(894 KB)
|
| Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
table of contents
Chicago, Illinois, USA
POSTER SESSION: Industry/government track poster
table of contents
Pages: 776 - 781
Year of Publication: 2005
ISBN:1-59593-135-X
|
|
Authors
|
|
Tao Li
|
Florida International University, Miami, FL
|
|
Feng Liang
|
Duke University, Durham, NC
|
|
Sheng Ma
|
IBM T.J. Watson Research Center, Hawthorne, NY
|
|
Wei Peng
|
Florida International University, Miami, FL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 93, Citation Count: 6
|
|
|
ABSTRACT
Traditional approaches to system management have been largely based on domain experts through a knowledge acquisition process that translates domain knowledge into operating rules and policies. This has been well known and experienced as a cumbersome, labor intensive, and error prone process. In addition, this process is difficult to keep up with the rapidly changing environments. In this paper, we will describe our research efforts on establishing an integrated framework for mining system log files for automatic management. In particular, we apply text mining techniques to categorize messages in log files into common situations, improve categorization accuracy by considering the temporal characteristics of log messages, develop temporal mining techniques to discover the relationships between different events, and utilize visualization tools to evaluate and validate the interesting temporal patterns for system management.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Mark Berman. Testing for spatial association between a point process and another stochastic process. Applied Statistics, 35(1):54--62, 1986.
|
| |
2
|
M. Chessell. Specification: Common base event, 2003. http://www-106.ibm.com /developerworks/webservices/library/ws-cbe/.
|
| |
3
|
Noel A.C. Cressie. Statistics for spatial data. John Wiley & Sons, 1991.
|
| |
4
|
Joseph L. Hellerstein, Sheng Ma, and Chang shing Perng. Discover actionable patterns in event data. IBM System Journal, 41(3):475--493, 2002.
|
| |
5
|
|
| |
6
|
|
| |
7
|
Nicholas Kushmerick, Edward Johnston, and Stephen McGuinness. Information extraction by text classification. Proceedings of the IJCAI-01 Workshop on Adaptive Text Extraction and Mining, 2001.
|
| |
8
|
T. R. Leek. Information extraction using hidden markov models. Master's thesis, UC San Diego, 1997.
|
| |
9
|
|
| |
10
|
|
| |
11
|
Feng Liang, Sheng Ma, and Joseph L. Hellerstein. Discovering fully dependent patterns. In SIAM DM, 2002.
|
| |
12
|
|
| |
13
|
|
| |
14
|
Heikki Mannila, Hannu Toivonen, and A. Inkeri Verkamo. Discovering frequent episodes in sequences. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining (SIGKDD'95), pages 210--215. AAAI Press, 1995.
|
| |
15
|
A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization, 1998.
|
| |
16
|
|
| |
17
|
|
| |
18
|
L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE, 77(2):257--286, 1989.
|
| |
19
|
IBM Market Research. Autonomic computing core technology study, 2003.
|
| |
20
|
Irina Rish. An empirical study of the naive Bayes classifier. In Proceedings of IJCAI-01 workshop on Empirical Methods in AI, pages 41--46, 2001.
|
 |
21
|
|
| |
22
|
|
| |
23
|
D. Stoyan, W.S. Kendall, and J. Mecke. Stochastic Geometry and its Applications. John wiley and Sons, 1995.
|
| |
24
|
Brad Topol, David Ogle, Donna Pierson, Jim Thoensen, John Sweitzer, Marie Chow, Mary Ann Hoffmann, Pamela Durham, Ric Telford, Sulabha Sheth, and Thomas Studwell. Automating problem determination: A first step toward self-healing computing systems. IBM White Paper, October 2003. http://www-106.ibm.com/developerworks/autonomic/library/ac-summary/ac-prob.html.
|
CITED BY 6
|
|
Chris Ding , Tao Li , Wei Peng , Haesun Park, Orthogonal nonnegative matrix t-factorizations for clustering, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
Sivan Sabato , Elad Yom-Tov , Aviad Tsherniak , Saharon Rosset, Analyzing system logs: a new view of what's important, Proceedings of the 2nd USENIX workshop on Tackling computer systems problems with machine learning techniques, p.1-7, April 10, 2007, Cambridge, MA
|
|
|
|
|
|
|
|
|
Wei Peng , Charles Perng , Tao Li , Haixun Wang, Event summarization for system management, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|