ACM Home Page
Please provide us with feedback. Feedback
Approximate embedding-based subsequence matching of time series
Full text PdfPdf (472 KB)
Source
International Conference on Management of Data archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data table of contents
Vancouver, Canada
SESSION: Research Session 9: Strings and Time table of contents
Pages 365-378  
Year of Publication: 2008
ISBN:978-1-60558-102-6
Authors
Vassilis Athitsos  University of Texas at Arlington, Arlington, TX, USA
Panagiotis Papapetrou  Boston University, Boston, MA, USA
Michalis Potamias  Boston University, Boston, MA, USA
George Kollios  Boston University, Boston, MA, USA
Dimitrios Gunopulos  University of Athens, Athens, Greece
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 186,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1376616.1376656
What is a DOI?

ABSTRACT

A method for approximate subsequence matching is introduced, that significantly improves the efficiency of subsequence matching in large time series data sets under the dynamic time warping (DTW) distance measure. Our method is called EBSM, shorthand for Embedding-Based Subsequence Matching. The key idea is to convert subsequence matching to vector matching using an embedding. This embedding maps each database time series into a sequence of vectors, so that every step of every time series in the database is mapped to a vector. The embedding is computed by applying full dynamic time warping between reference objects and each database time series. At runtime, given a query object, an embedding of that object is computed in the same manner, by running dynamic time warping between the reference objects and the query. Comparing the embedding of the query with the database vectors is used to efficiently identify relatively few areas of interest in the database sequences. Those areas of interest are then fully explored using the exact DTW-based subsequence matching algorithm. Experiments on a large, public time series data set produce speedups of over one order of magnitude compared to brute-force search, with very small losses (< 1%) in retrieval accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Alon, V. Athitsos, and S. Sclaroff. Accurate and efficient gesture spotting via pruning and subgesture reasoning. In IEEE Workshop on Human Computer Interaction, pages 189--198, 2005.
 
2
3
4
 
5
6
7
8
 
9
 
10
 
11
 
12
13
 
14
15
 
16
17
 
18
E. Keogh, X. Xi, L. Wei, and C. A. Ratanamahatana. The UCR time series classification/clustering homepage: www.cs.ucr.edu/~eamonn/time_series_data/, 2006.
 
19
 
20
J. B. Kruskall and M. Liberman. The symmetric time warping algorithm: From continuous to discrete. In Time Warps. Addison-Wesley, 1983.
 
21
 
22
23
 
24
 
25
P. Morguet and M. Lang. Spotting dynamic hand gestures in video image sequences using hidden Markov models. In IEEE International Conference on Image Processing, pages 193--197, 1998.
 
26
R. Oka. Spotting method for classification of real world data. The Computer Journal, 41(8):559--565, July 1998.
 
27
28
29
 
30
T. M. Rath and R. Manmatha. Word image matching using dynamic time warping. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 521--527, 2003.
 
31
Y. Sakurai, C. Faloutsos, and M. Yamamuro. Stream monitoring under the time warping distance. In IEEE International Conference on Data Engineering (ICDE), 2007.
32
 
33
 
34
35
 
36
37
 
38
X. Wang, J. T. L. Wang, K. I. Lin, D. Shasha, B. A. Shapiro, and K. Zhang. An index structure for data mining and clustering. Knowledge and Information Systems, 2(2):161--184, 2000.
 
39
 
40
 
41
D. A. White and R. Jain. Similarity indexing: Algorithms and performance. In Storage and Retrieval for Image and Video Databases (SPIE), pages 62--73, 1996.
42
 
43
44


Collaborative Colleagues:
Vassilis Athitsos: colleagues
Panagiotis Papapetrou: colleagues
Michalis Potamias: colleagues
George Kollios: colleagues
Dimitrios Gunopulos: colleagues