ACM Home Page
Please provide us with feedback. Feedback
Data compression with finite windows
Full text PdfPdf (1.89 MB)
Source
Communications of the ACM archive
Volume 32 ,  Issue 4  (April 1989) table of contents
Pages: 490 - 505  
Year of Publication: 1989
ISSN:0001-0782
Authors
E. R. Fiala  Xerox PARC, Palo Alto, CA
D. H. Greene  Xerox PARC, Palo Alto, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 74,   Citation Count: 24
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/63334.63341
What is a DOI?

ABSTRACT

Several methods are presented for adaptive, invertible data compression in the style of Lempel's and Ziv's first textual substitution proposal. For the first two methods, the article describes modifications of McCreight's suffix tree data structure that support cyclic maintenance of a window on the most recent source characters. A percolating update is used to keep node positions within the window, and the updating process is shown to have constant amortized cost. Other methods explore the tradeoffs between compression time, expansion time, data structure size, and amount of compression achieved. The article includes a graph-theoretic analysis of the compression penalty incurred by our codeword selection policy in comparison with an optimal policy, and it includes empirical studies of the performance of various adaptive compressors from the literature.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Abramson, N. Information Theory and Coding. McGraw-Hill, 1963, 61.
 
2
Basharin, G.P. On a statistical estimate for the entropy of a sequence of independent random variables. Theory Prob. Appl. 4, (1959), 333-336.
 
3
Bell, T.C. Better OPM/L text compression. IEEE Trans. Commun. C0M-34, 12 (1986), 1176-1182.
 
4
5
 
6
Cleary, J.G., and Witten, I.H. Data compression using adaptive coding and partial string matching. IEEE Trans. Commun. COM-32, 4 (1984), 396-402.
 
7
 
8
Dantzig, G.B., Blattner, W.O., and Rao, M.R. Finding a Cycle in a Graph with Minimum Cost to Time Ratio with Application to a Ship Routing Problem. Theory of Graphs, P. Rosenstiehl, ed. Gordon and Breach, 1966.
 
9
Elias, P. Universal codeword sets and representations of the integers. /GEE Trans. Info. Theory IT-21, 2 (1975), 194-203.
10
 
11
Gallager, R.G. Variations on a theme by Huffman. IEEE Trans. lnfo. Theory IT-24, 6 (1978), 668-674.
 
12
Gasbarro, }. An Architecture for High-Performance VLSI Testers. Ph.D. dissertation, Department of Electrical Engineering, Stanford University, 1988.
 
13
Golomb, S.W. Run-Length Encodings. IEEE Trans. Info. Theory IT-12 (1966), 399-401.
 
14
Guazzo, M. A General Minimum Redundancy Source-coding Algorithm. IEEE Trans. Info. Theory IT-26, I (1980), 15-25.
 
15
 
16
Horspool, R.N., and Cormack, G.V. Dynamic Markov Modelling-- A Prediction Technique. In Proceedings of the 19th Annual Hawaii International Conference on System Sciences (1986), 700-707.
 
17
Huffman, D.A. A method for the construction of minimum-redundancy codes. In Proceedings of the I.R.E. 40 (1952), 1098-1101.
 
18
Hunter, R., and Robinson, A.H. International digital facsimile coding standards. In Proceedings of the IEEE 68, 7 (1980), 854-867.
 
19
 
20
Jones, C.B. An efficient coding system for long source sequences. IEEE Trans. Info. Theory IT-27, 3 (1981), 280-291.
 
21
Kempf, M., Bayer, R., and Gfintzer, U. Time Optimal Left to Right Construction of Position Trees. Acta Informatica, 24 (1987), 461-474.
 
22
 
23
 
24
Langdon, G.G., Jr. A note on the Ziv-Lempel model for compressing individual sequences IEEE Trans. Info. Theory IT-29, 2 (1983}, 284-287.
 
25
Langdon, G.G., Jr., and Rissanen, J. Compression of Black-White Images with Arithmetic Coding. IEEE Trans. Commun. COM-29, 6 (1981), 858-867.
 
26
Langdon, G.G., Jr., and Rissanen, J. A Double Adaptive File Compression Algorithm. IEEE Trans. Commun. COM-31, 11 (1983), 1253-1255.
 
27
Lawler, E.L. Combinatorial Optimization: Networks and Matroids. Holt, Rinehart and Winston, 1976.
28
 
29
Miller, V.S., and Wegman, M.N. Variations on a theme by Ziv and Lempel. IBM Res. Rep. RC 10630 (#47798), 1984. Combinatorial Algorithms on Words, NATO ASI Series F, 12 (1985), 131-140.
30
 
31
 
32
Rissanen, J., and Langdon, G.G.. Jr. Arithmetic Coding. IBM Journal of Research and Development 23, 2 (1979), 149-162.
 
33
Rissanen, J., and Langdon, G.G., Jr. Universal Modeling and Coding. IEEE Trans. Info. Theory IT-27, 1 (1981}, 12-23.
34
 
35
Shannon, C.E. A mathematical theory of communication. The Bell System Technical Journal 27, 3, 379-423 and 27, 4 (1948), 623-656.
36
 
37
Vitter, J.S. Design and analysis of dynamic Huffman coding. Brown University Technical Report No. CS-85-13, 1985.
 
38
Wether, P. Linear Pattern Matching Algorithms. Fourteenth Annual Symposium on Switching and Automata Theory, 1-11, 1973.
 
39
Welch, T.A. A technique for high performance data compression. IEEE Comp. 17, 6 (1984}, 8-19.
40
 
41
Ziv, J. Coding theorems for individual sequences IEEE Trans. Info. Theory IT-24,4 (1978), 405-412.
 
42
Ziv, J., and Lempel, A. A universal algorithm for sequential data compression. IEEE Trans. Info. Theory IT-23, 3 (1977), 337-343.
 
43
Ziv, J., and Lempel, A. Compression of individual sequences via variable-rate coding. IEEE Trans. Info. Theory IT-24, 5 (1978}, 530-536.

CITED BY  24


REVIEW

"R. Nigel Horspool : Reviewer"

The research of Ziv and Lempel led to a family of compression methods based on the idea of replacing a group of characters by a reference to an earlier occurrence of that same group. The authors explore one particular form of this compression me  more...

Collaborative Colleagues:
E. R. Fiala: colleagues
D. H. Greene: colleagues