|
ABSTRACT
Several methods are presented for adaptive, invertible data compression in the style of Lempel's and Ziv's first textual substitution proposal. For the first two methods, the article describes modifications of McCreight's suffix tree data structure that support cyclic maintenance of a window on the most recent source characters. A percolating update is used to keep node positions within the window, and the updating process is shown to have constant amortized cost. Other methods explore the tradeoffs between compression time, expansion time, data structure size, and amount of compression achieved. The article includes a graph-theoretic analysis of the compression penalty incurred by our codeword selection policy in comparison with an optimal policy, and it includes empirical studies of the performance of various adaptive compressors from the literature.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Abramson, N. Information Theory and Coding. McGraw-Hill, 1963, 61.
|
| |
2
|
Basharin, G.P. On a statistical estimate for the entropy of a sequence of independent random variables. Theory Prob. Appl. 4, (1959), 333-336.
|
| |
3
|
Bell, T.C. Better OPM/L text compression. IEEE Trans. Commun. C0M-34, 12 (1986), 1176-1182.
|
| |
4
|
|
 |
5
|
|
| |
6
|
Cleary, J.G., and Witten, I.H. Data compression using adaptive coding and partial string matching. IEEE Trans. Commun. COM-32, 4 (1984), 396-402.
|
| |
7
|
|
| |
8
|
Dantzig, G.B., Blattner, W.O., and Rao, M.R. Finding a Cycle in a Graph with Minimum Cost to Time Ratio with Application to a Ship Routing Problem. Theory of Graphs, P. Rosenstiehl, ed. Gordon and Breach, 1966.
|
| |
9
|
Elias, P. Universal codeword sets and representations of the integers. /GEE Trans. Info. Theory IT-21, 2 (1975), 194-203.
|
 |
10
|
|
| |
11
|
Gallager, R.G. Variations on a theme by Huffman. IEEE Trans. lnfo. Theory IT-24, 6 (1978), 668-674.
|
| |
12
|
Gasbarro, }. An Architecture for High-Performance VLSI Testers. Ph.D. dissertation, Department of Electrical Engineering, Stanford University, 1988.
|
| |
13
|
Golomb, S.W. Run-Length Encodings. IEEE Trans. Info. Theory IT-12 (1966), 399-401.
|
| |
14
|
Guazzo, M. A General Minimum Redundancy Source-coding Algorithm. IEEE Trans. Info. Theory IT-26, I (1980), 15-25.
|
| |
15
|
|
| |
16
|
Horspool, R.N., and Cormack, G.V. Dynamic Markov Modelling-- A Prediction Technique. In Proceedings of the 19th Annual Hawaii International Conference on System Sciences (1986), 700-707.
|
| |
17
|
Huffman, D.A. A method for the construction of minimum-redundancy codes. In Proceedings of the I.R.E. 40 (1952), 1098-1101.
|
| |
18
|
Hunter, R., and Robinson, A.H. International digital facsimile coding standards. In Proceedings of the IEEE 68, 7 (1980), 854-867.
|
| |
19
|
|
| |
20
|
Jones, C.B. An efficient coding system for long source sequences. IEEE Trans. Info. Theory IT-27, 3 (1981), 280-291.
|
| |
21
|
Kempf, M., Bayer, R., and Gfintzer, U. Time Optimal Left to Right Construction of Position Trees. Acta Informatica, 24 (1987), 461-474.
|
| |
22
|
|
| |
23
|
|
| |
24
|
Langdon, G.G., Jr. A note on the Ziv-Lempel model for compressing individual sequences IEEE Trans. Info. Theory IT-29, 2 (1983}, 284-287.
|
| |
25
|
Langdon, G.G., Jr., and Rissanen, J. Compression of Black-White Images with Arithmetic Coding. IEEE Trans. Commun. COM-29, 6 (1981), 858-867.
|
| |
26
|
Langdon, G.G., Jr., and Rissanen, J. A Double Adaptive File Compression Algorithm. IEEE Trans. Commun. COM-31, 11 (1983), 1253-1255.
|
| |
27
|
Lawler, E.L. Combinatorial Optimization: Networks and Matroids. Holt, Rinehart and Winston, 1976.
|
 |
28
|
|
| |
29
|
Miller, V.S., and Wegman, M.N. Variations on a theme by Ziv and Lempel. IBM Res. Rep. RC 10630 (#47798), 1984. Combinatorial Algorithms on Words, NATO ASI Series F, 12 (1985), 131-140.
|
 |
30
|
|
| |
31
|
|
| |
32
|
Rissanen, J., and Langdon, G.G.. Jr. Arithmetic Coding. IBM Journal of Research and Development 23, 2 (1979), 149-162.
|
| |
33
|
Rissanen, J., and Langdon, G.G., Jr. Universal Modeling and Coding. IEEE Trans. Info. Theory IT-27, 1 (1981}, 12-23.
|
 |
34
|
|
| |
35
|
Shannon, C.E. A mathematical theory of communication. The Bell System Technical Journal 27, 3, 379-423 and 27, 4 (1948), 623-656.
|
 |
36
|
|
| |
37
|
Vitter, J.S. Design and analysis of dynamic Huffman coding. Brown University Technical Report No. CS-85-13, 1985.
|
| |
38
|
Wether, P. Linear Pattern Matching Algorithms. Fourteenth Annual Symposium on Switching and Automata Theory, 1-11, 1973.
|
| |
39
|
Welch, T.A. A technique for high performance data compression. IEEE Comp. 17, 6 (1984}, 8-19.
|
 |
40
|
|
| |
41
|
Ziv, J. Coding theorems for individual sequences IEEE Trans. Info. Theory IT-24,4 (1978), 405-412.
|
| |
42
|
Ziv, J., and Lempel, A. A universal algorithm for sequential data compression. IEEE Trans. Info. Theory IT-23, 3 (1977), 337-343.
|
| |
43
|
Ziv, J., and Lempel, A. Compression of individual sequences via variable-rate coding. IEEE Trans. Info. Theory IT-24, 5 (1978}, 530-536.
|
CITED BY 24
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marshall Bern , Daniel Greene , Arvind Raghunathan, On-line algorithms for cache sharing, Proceedings of the twenty-fifth annual ACM symposium on Theory of computing, p.422-430, May 16-18, 1993, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
M. Bern , D. H. Greene , A. Raghunathan , M. Sudan, Online algorithms for locating checkpoints, Proceedings of the twenty-second annual ACM symposium on Theory of computing, p.359-368, May 13-17, 1990, Baltimore, Maryland, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Natthapon Punthong , Athasit Surarerks, Online prefix-free encoding algorithm, Proceedings of the 24th IASTED international conference on Signal processing, pattern recognition, and applications, p.165-170, February 15-17, 2006, Innsbruck, Austria
|
|
|
Sarav Bhatia , Curtis Dahn , Jason Chong Lee , Miten Sampat , D. Scott McCrickard, VTAssist: a location-based feedback notification system for the disabled, Proceedings of the 44th annual southeast regional conference, March 10-12, 2006, Melbourne, Florida
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"R. Nigel Horspool : Reviewer"
The research of Ziv and Lempel led to a family of compression methods based
on the idea of replacing a group of characters by a reference to an earlier
occurrence of that same group. The authors explore one particular form
of this compression me
more...
|