|
ABSTRACT
Continuous growth in sensor data and other temporal data increases the importance of retrieval and similarity search in time series data. Efficient time series query processing is crucial for interactive applications. Existing multidimensional indexes like the R-tree provide efficient querying for only relatively few dimensions. Time series are typically long which corresponds to extremely high dimensional data in multidimensional indexes. Due to massive overlap of index descriptors, multidimensional indexes degenerate for high dimensions and access the entire data by random I/O. Consequently, the efficiency benefits of indexing are lost. In this paper, we propose the TS-tree (time series tree), an index structure for efficient time series retrieval and similarity search. Exploiting inherent properties of time series quantization and dimensionality reduction, the TS-tree indexes high-dimensional data in an overlap-free manner. During query processing, powerful pruning via quantized separator and meta data information greatly reduces the number of pages which have to be accessed, resulting in substantial speed-up. In thorough experiments on synthetic and real world time series data we demonstrate that our TS-tree outperforms existing approaches like the R*-tree or the quantized A-tree.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
R. Bayer and E. M. McCreight. Organization and maintenance of large ordered indexes. In ACM SIGFIDET Workshop, pages 107--141, 1970.
|
| |
3
|
R. Bayer and E. M. McCreight. Organization and maintenance of large ordered indices. Acta Inform., 1:173--189, 1972.
|
 |
4
|
|
 |
5
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
6
|
|
| |
7
|
|
| |
8
|
D. J. Berndt and J. Clifford. Using dynamic time warping to find patterns in time series. In AAAI KDD Workshop, pages 229--248, 1994.
|
| |
9
|
|
 |
10
|
|
| |
11
|
C. Chatfield. The Analysis of Time Series: an Introduction. Chapman and Hall, 1996.
|
 |
12
|
|
| |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
E. Keogh. UCR time series archive, http://www.cs.ucr.edu/~eamonn/TSDMA/datasets.html, 2006.
|
| |
18
|
E. Keogh, K. Chakrabarti, M. Pazzani, and Mehrotra. Dimensionality reduction for fast similarity search in large time series databases. KAIS, pages 263--286, 2000.
|
 |
19
|
|
 |
20
|
Jessica Lin , Eamonn Keogh , Stefano Lonardi , Bill Chiu, A symbolic representation of time series, with implications for streaming algorithms, Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, June 13-13, 2003, San Diego, California
[doi> 10.1145/882082.882086]
|
| |
21
|
|
 |
22
|
|
| |
23
|
H. Sakoe and S. Chiba. A dynamic programming approach to continuous speech recognition. In Proc. ICA, pages 65--68, 1971.
|
| |
24
|
|
 |
25
|
|
| |
26
|
|
 |
27
|
Michail Vlachos , Christopher Meek , Zografoula Vagena , Dimitrios Gunopulos, Identifying similarities, periodicities and bursts for online search queries, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007586]
|
| |
28
|
|
| |
29
|
|
 |
30
|
|
|