|
ABSTRACT
We consider single and multiple string matching in small space and optimal average time. Our algorithm is based on the combination of compressed self-indexes and Backward-DAWG-Matching (BDM) algorithm. We consider several implementation techniques having different space/time and implementation complexity trade-offs. The experimental results show that our approach has much smaller space requirements than BDM, while being much faster and easier to implement. We show that some of our techniques can boost the search speed of compressed self-indexes as well, using a small amount of additional space.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Allauzen, C. and Raffinot, M. 1999. Factor oracle of a set of words. Tech. rep. 99--11, Institut Gaspard-Monge, Université de Marne-la-Vallée.
|
 |
3
|
|
 |
4
|
|
| |
5
|
Burrows, M. and Wheeler, D. 1994. A block sorting lossless data compression algorithm. Tech. rep. 124, Digital Equipment Corporation.
|
| |
6
|
|
| |
7
|
Crochemore, M., Czumaj, A., Gasieniec, L., Jarominek, S., Lecroq, T., Plandowski, W., and Rytter, W. 1994. Speeding up two string matching algorithms. Algorithmica 12, 4/5, 247--267.
|
 |
8
|
|
| |
9
|
|
| |
10
|
Crochemore, M. and Rytter, W. 1995. Squares, cubes and time-space efficient string-searching. Algorithmica 13, 5, 405--425.
|
| |
11
|
Crochemore, M. and Rytter, W. 2002. Jewels of Stringology. World Scientific.
|
| |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
Fredriksson, K. 2007. Succinct pattern matching automata. Tech, rep. A-2007-1, Department of Computer Science, University of Joensuu.
|
| |
16
|
Fredriksson, K. and Nikitin, F. 2007. Simple compression code supporting random access and fast string matching. In Proceedings of the 6th Workshop on Efficient and Experimental Algorithms (WEA'07). LNCS 4525. Springer--Verlag, 203--216.
|
| |
17
|
Galil, Z. and Seiferas, J. 1981. Linear-time string matching using only a fixed number of local storage locations. Theor. Comput. Sci. 13, 3, 331--336.
|
 |
18
|
|
| |
19
|
González, R., Grabowski, S., M&akinenuml;, V., and Navarro, G. 2005. Practical implementation of rank and select queries. In Poster Proceedings Volume of 4th Workshop on Efficient and Experimental Algorithms (WEA'05). CTI Press and Ellinika Grammata, Greece, 27--38.
|
| |
20
|
|
| |
21
|
|
| |
22
|
Horspool, R. N. 1980. Practical fast searching in strings. Softw. Pract. Exp. 10, 6, 501--506.
|
| |
23
|
Huffman, D. A. 1951. A method for the construction of minimum redundancy codes. Proceedings of the I.R.E. 40, 1098--1101.
|
 |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
Kim, D. K., Na, J. C., Kim, J. E., and Park., K. 2005. Efficient implementation of rank and select functions for succinct representation. In Proceedings of the 4th Workshop on Efficient and Experimental Algorithms (WEA'05). Springer, Berlin, Germany, 315--327.
|
| |
28
|
Knuth, D. E., Morris, Jr, J. H., and Pratt, V. R. 1977. Fast pattern matching in strings. SIAM J. Comput. 6, 1, 323--350.
|
| |
29
|
|
| |
30
|
|
| |
31
|
|
 |
32
|
|
 |
33
|
|
| |
34
|
|
| |
35
|
Okanohara, D. and Sadakane, K. 2007. Practical entropy-compressed rank/select dictionary. In Proceedings of ALENEX'07. ACM Press, New York.
|
| |
36
|
|
 |
37
|
|
 |
38
|
|
| |
39
|
|
| |
40
|
|
 |
41
|
|
| |
42
|
Yao, A. C. 1979. The complexity of pattern matching for a random string. SIAM J. Comput. 8, 3, 368--387.
|
|