ACM Home Page
Please provide us with feedback. Feedback
Visually mining and monitoring massive time series
Full text PdfPdf (923 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
SESSION: Industry/government track papers table of contents
Pages: 460 - 469  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Jessica Lin  University of California - Riverside, Riverside, CA
Eamonn Keogh  University of California - Riverside, Riverside, CA
Stefano Lonardi  University of California - Riverside, Riverside, CA
Jeffrey P. Lankford  The Aerospace Corporation, El Segundo, CA
Donna M. Nystrom  The Aerospace Corporation, El Segundo, CA
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 33,   Downloads (12 Months): 187,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014104
What is a DOI?

ABSTRACT

Moments before the launch of every space vehicle, engineering discipline specialists must make a critical go/no-go decision. The cost of a false positive, allowing a launch in spite of a fault, or a false negative, stopping a potentially successful launch, can be measured in the tens of millions of dollars, not including the cost in morale and other more intangible detriments. The Aerospace Corporation is responsible for providing engineering assessments critical to the go/no-go decision for every Department of Defense space vehicle. These assessments are made by constantly monitoring streaming telemetry data in the hours before launch. We will introduce VizTree, a novel time-series visualization tool to aid the Aerospace analysts who must make these engineering assessments. VizTree was developed at the University of California, Riverside and is unique in that the same tool is used for mining archival data and monitoring incoming live telemetry. The use of a single tool for both aspects of the task allows a natural and intuitive transfer of mined knowledge to the monitoring task. Our visualization approach works by transforming the time series into a symbolic representation, and encoding the data in a modified suffix tree in which the frequency and other properties of patterns are mapped onto colors and other visual properties. We demonstrate the utility of our system by comparing it with state-of-the-art batch algorithms on several real and synthetic datasets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
Cardle, M. (2004). Ph.D Thesis, in progress. University of Cambridge.
 
5
 
6
Chen, L., Ozsu, T. & Oria, V. (2003). Symbolic Representation and Retrieval of Moving Object Trajectories. University of Waterloo. 2003.
7
 
8
Crochemore, M., Czumaj, A., Gasjeniec, L., Jarominek, S., Lecroq, T., Plandowski, W. & Rytter, W. (1994). Speeding Up Two String-Matching Algorithms. Algorithmica. vol. 12(4/5). pp. 247-267.
 
9
Dasgupta, D. & Forrest, S. (1999). Novelty Detection in Time Series Data Using Ideas from Immunology. In proceedings of the 8th Int'l Conference on Intelligent Systems. Denver, CO, Jun 24-26.
 
10
Durbin, R., Eddy, S., Krogh, A. & Mitchison, G. (1998). Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press.
11
 
12
Gabglio, A. (1888). Theoria Generale Della Statistica, 2nd ed. Milan.
 
13
Goldberger, A. L., Amaral, L. A., Glass, L., Hausdorff, J. M., Ivanov, P. C., Mark, R. G., Mietus, J. E., Moody, G. B., Peng, C. K. & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Componenets of a New Research Resource for Complex Physiologic Signals. Circulation. vol. 101(23), June 13. pp. e215--e220. {Circulation Electronic Pages; http://circ.ahajournals.org/cgi/content/full/101/23/e215}
 
14
15
 
16
Huettel, S., Mack, P. B. & McCarthy, G. (2002). Perceiving Patterns in Random Series: Dynamic Processing of Sequence in Prefrontal Cortex. Nature Neuroscience. vol. 5. pp. 485--490.
 
17
Isaac, D. & Lynnes, C. (2003). Automated Data Quality Assessment in the Intelligent Archive, White Paper prepared for the Intelligent Data Understanding program. 2003. pp. 17.
 
18
 
19
 
20
Keogh, E. The UCR Time Series Data Mining Archive. http://www.cs.ucr.edu/~eamonn/tsdma/index.html
21
22
 
23
Keogh, E. & Lin, J. (2004). Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research. Knowledge and Information Systems Journal. To Appear
24
 
25
Kim, S., Kim, Y., Ahn, T., Nam, H. K., Han, B. J. & Kim, S. M. (2000). Visualysis: A Tool for Biological Sequence Analysis. In proceedings of the 4th Int'l Conference on Computational Molecular Biology. Tokyo, Japan, Apr 8--11.
 
26
Lankford, J. P. & Quan, A. (2002). Evolution of Knowledge-Based Applications for Launch Support. In proceedings of Ground System Architecture Workshop. El Segundo, CA.
 
27
Lin, J. VizTree Website. http://www.cs.ucr.edu/~jessica/viztree.htm
28
 
29
Lin, J., Keogh, E., Patel, P. & Lonardi, S. (2002). Finding Motifs in Time Series. In the 2nd Workshop on Temporal Data Mining, the 8th ACM Int'l Conference on Knowledge Discovery and Data Mining. Edmonton, Alberta, Canada. July 23-26, 2002.
30
31
 
32
 
33
Ohsaki, M., Sato, Y., Yokoi, H. & Yamaguchi, T. (2003). A Rule Discovery Support System for Sequential Medical Data, in the Case Study of a Chronic Hepatitis Dataset. In Discovery Challenge Workshop, the 14th European Conference on Machine Learning/the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases. Cavtat-Dubrovnik, Croatia. Sep 22-26, 2003.
 
34
 
35
Reinert, G., Schbath, S. & Waterman, M. S. (2000). Probabilistic and Statistical Properties of Words: An Overview. Journal of Computational Biology. vol. 7. pp. 1--46.
 
36
 
37
 
38
Tanaka, Y. & Uehara, K. (2003). Discover Motifs in Multi Dimensional Time-Series Using the Principal Component Analysis and the MDL Principle. In proceedings of the 3rd Int'l Conference on Machine Learning and Data Mining in Pattern Recognition. Leipzig, Germany, Jul 5-7. pp. 252--265.
 
39
 
40
 
41

CITED BY  11

Collaborative Colleagues:
Jessica Lin: colleagues
Eamonn Keogh: colleagues
Stefano Lonardi: colleagues
Jeffrey P. Lankford: colleagues
Donna M. Nystrom: colleagues