ACM Home Page
Please provide us with feedback. Feedback
Querying and mining of time series data: experimental comparison of representations and distance measures
Full text PdfPdf (1.64 MB)
Source
Proceedings of the VLDB Endowment archive
Volume 1 ,  Issue 2  (August 2008) table of contents
SESSION: EXPERIMENTS AND ANALYSES table of contents
Pages 1542-1552  
Year of Publication: 2008
ISSN:2150-8097
Authors
Hui Ding  Northwestern University, Evanston, IL
Goce Trajcevski  Northwestern University, Evanston, IL
Peter Scheuermann  Northwestern University, Evanston, IL
Xiaoyue Wang  University of California, Riverside, CA
Eamonn Keogh  University of California, Riverside, CA
Publisher
Bibliometrics
Downloads (6 Weeks): 38,   Downloads (12 Months): 271,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1454159.1454226
What is a DOI?

ABSTRACT

The last decade has witnessed a tremendous growths of interests in applications that deal with querying and mining of time series data. Numerous representation methods for dimensionality reduction and similarity measures geared towards time series have been introduced. Each individual work introducing a particular method has made specific claims and, aside from the occasional theoretical justifications, provided quantitative experimental observations. However, for the most part, the comparative aspects of these experiments were too narrowly focused on demonstrating the benefits of the proposed methods over some of the previously introduced ones. In order to provide a comprehensive validation, we conducted an extensive set of time series experiments re-implementing 8 different representation methods and 9 similarity measures and their variants, and testing their effectiveness on 38 time series data sets from a wide variety of application domains. In this paper, we give an overview of these different techniques and present our comparative experimental findings regarding their effectiveness. Our experiments have provided both a unified validation of some of the existing achievements, and in some cases, suggested that certain claims in the literature may be unduly optimistic.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Additional Experiment Results for Representation and Similarity Measures of Time Series. http://www.ece.northwestern.edu/~hdi117/tsim.htm.
 
2
R. T. Ng (2006), Note of Caution. http://www.cs.ubc.ca/~rng/psdepository/chebyReport2.pdf.
 
3
H. André-Jönsson and D. Z. Badal. Using signature files for querying time-series data. In PKDD, 1997.
 
4
J. Aßfalg, H.-P. Kriegel, P. Kröger, P. Kunath, A. Pryakhin, and M. Renz. Similarity search on time series based on threshold queries. In EDBT, 2006.
 
5
D. J. Berndt and J. Clifford. Using dynamic time warping to find patterns in time series. In KDD Workshop, 1994.
6
 
7
M. Cardle. Automated motion editing. In Technical Report, Computer Laboratory, University of Cambridge, UK, 2004.
 
8
9
 
10
 
11
 
12
Y. Chen, M. A. Nascimento, B. C. Ooi, and A. K. H. Tung. SpADe: On Shape-based Pattern Detection in Streaming Time Series. In ICDE, 2007.
13
 
14
E. Frentzos, K. Gratsias, and Y. Theodoridis. Index-based most similar trajectory search. In ICDE, 2007.
 
15
 
16
P. Geurts. Contributions to Decision Tree Induction: bias/variance tradeoff and time series classification. PhD thesis, University of Lige, Belgium, May 2002.
 
17
 
18
 
19
 
20
E. Keogh, X. Xi, L. Wei, and C. Ratanamahatana. The UCR Time Series dataset. In http://www.cs.ucr.edu/~eamonn/time_series_data/, 2006.
 
21
 
22
23
 
24
E. J. Keogh, K. Chakrabarti, M. J. Pazzani, and S. Mehrotra. Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. Knowl. Inf. Syst., 3(3), 2001.
 
25
 
26
 
27
 
28
R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In IJCAI, 1995.
29
 
30
31
 
32
 
33
K. pong Chan and A. W.-C. Fu. Efficient Time Series Matching by Wavelets. In ICDE, 1999.
 
34
I. Popivanov and R. J. Miller. Similarity Search Over Time-Series Data Using Wavelets. In ICDE, 2002.
 
35
C. A. Ratanamahatana and E. J. Keogh. Three myths about dynamic time warping data mining. In SDM, 2005.
 
36
Richard O. Duda and Peter E. Hart. Pattern Classification and Scene Analysis. John Wiley & Sons, 1973.
 
37
38
 
39
 
40
41
42
 
43
 
44
45


Collaborative Colleagues:
Hui Ding: colleagues
Goce Trajcevski: colleagues
Peter Scheuermann: colleagues
Xiaoyue Wang: colleagues
Eamonn Keogh: colleagues