| Approximate embedding-based subsequence matching of time series |
| Full text |
Pdf
(472 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
table of contents
Vancouver, Canada
SESSION: Research Session 9: Strings and Time
table of contents
Pages 365-378
Year of Publication: 2008
ISBN:978-1-60558-102-6
|
|
Authors
|
|
Vassilis Athitsos
|
University of Texas at Arlington, Arlington, TX, USA
|
|
Panagiotis Papapetrou
|
Boston University, Boston, MA, USA
|
|
Michalis Potamias
|
Boston University, Boston, MA, USA
|
|
George Kollios
|
Boston University, Boston, MA, USA
|
|
Dimitrios Gunopulos
|
University of Athens, Athens, Greece
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 186, Citation Count: 1
|
|
|
ABSTRACT
A method for approximate subsequence matching is introduced, that significantly improves the efficiency of subsequence matching in large time series data sets under the dynamic time warping (DTW) distance measure. Our method is called EBSM, shorthand for Embedding-Based Subsequence Matching. The key idea is to convert subsequence matching to vector matching using an embedding. This embedding maps each database time series into a sequence of vectors, so that every step of every time series in the database is mapped to a vector. The embedding is computed by applying full dynamic time warping between reference objects and each database time series. At runtime, given a query object, an embedding of that object is computed in the same manner, by running dynamic time warping between the reference objects and the query. Comparing the embedding of the query with the database vectors is used to efficiently identify relatively few areas of interest in the database sequences. Those areas of interest are then fully explored using the exact DTW-based subsequence matching algorithm. Experiments on a large, public time series data set produce speedups of over one order of magnitude compared to brute-force search, with very small losses (< 1%) in retrieval accuracy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Alon, V. Athitsos, and S. Sclaroff. Accurate and efficient gesture spotting via pruning and subgesture reasoning. In IEEE Workshop on Human Computer Interaction, pages 189--198, 2005.
|
| |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
 |
6
|
|
 |
7
|
Christos Faloutsos , King-Ip Lin, FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.163-174, May 22-25, 1995, San Jose, California, United States
|
 |
8
|
Christos Faloutsos , M. Ranganathan , Yannis Manolopoulos, Fast subsequence matching in time-series databases, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.419-429, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
 |
15
|
K. V. Ravi Kanth , Divyakant Agrawal , Ambuj Singh, Dimensionality reduction for similarity searching in dynamic databases, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.166-176, June 01-04, 1998, Seattle, Washington, United States
|
| |
16
|
|
 |
17
|
|
| |
18
|
E. Keogh, X. Xi, L. Wei, and C. A. Ratanamahatana. The UCR time series classification/clustering homepage: www.cs.ucr.edu/~eamonn/time_series_data/, 2006.
|
| |
19
|
|
| |
20
|
J. B. Kruskall and M. Liberman. The symmetric time warping algorithm: From continuous to discrete. In Time Warps. Addison-Wesley, 1983.
|
| |
21
|
|
| |
22
|
|
 |
23
|
|
| |
24
|
|
| |
25
|
P. Morguet and M. Lang. Spotting dynamic hand gestures in video image sequences using hidden Markov models. In IEEE International Conference on Image Processing, pages 193--197, 1998.
|
| |
26
|
R. Oka. Spotting method for classification of real world data. The Computer Journal, 41(8):559--565, July 1998.
|
| |
27
|
|
 |
28
|
|
 |
29
|
|
| |
30
|
T. M. Rath and R. Manmatha. Word image matching using dynamic time warping. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 521--527, 2003.
|
| |
31
|
Y. Sakurai, C. Faloutsos, and M. Yamamuro. Stream monitoring under the time warping distance. In IEEE International Conference on Data Engineering (ICDE), 2007.
|
 |
32
|
|
| |
33
|
|
| |
34
|
|
 |
35
|
|
| |
36
|
|
 |
37
|
|
| |
38
|
X. Wang, J. T. L. Wang, K. I. Lin, D. Shasha, B. A. Shapiro, and K. Zhang. An index structure for data mining and clustering. Knowledge and Information Systems, 2(2):161--184, 2000.
|
| |
39
|
|
| |
40
|
|
| |
41
|
D. A. White and R. Jain. Similarity indexing: Algorithms and performance. In Storage and Retrieval for Image and Video Databases (SPIE), pages 62--73, 1996.
|
 |
42
|
Huanmei Wu , Betty Salzberg , Gregory C Sharp , Steve B Jiang , Hiroki Shirato , David Kaeli, Subsequence matching on structured time series data, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
[doi> 10.1145/1066157.1066235]
|
| |
43
|
|
 |
44
|
|
|