ACM Home Page
Please provide us with feedback. Feedback
Huffman coding with unequal letter costs
Full text PdfPdf (160 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thiry-fourth annual ACM symposium on Theory of computing table of contents
Montreal, Quebec, Canada
SESSION: Session 12B table of contents
Pages: 785 - 791  
Year of Publication: 2002
ISBN:1-58113-495-9
Authors
Mordecai J. Golin  Hong Kong UST, Kowloon, Hong Kong
Claire Kenyon  Université Paris-Sud, France
Neal E. Young  Akamai Technologies, Cambridge, MA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 73,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/509907.510020
What is a DOI?

ABSTRACT

(MATH) In the standard Huffman coding problem, one is given a set of words and for each word a positive frequency. The goal is to encode each word w as a codeword c(w) over a given alphabet. The encoding must be prefix free (no codeword is a prefix of any other) and should minimize the weighted average codeword size &Sgr;w freq w, &124;c(w)&124;. The problem has a well-known polynomial-time algorithm due to Huffman [15].Here we consider the generalization in which the letters of the encoding alphabet may have non-uniform lengths. The goal is to minimize the weighted average codeword length &Sgr;w freq (w) cost(c(w)), where cost s is the sum of the (possibly non-uniform) lengths of the letters in s. Despite much previous work, the problem is not known to be NP-hard, nor was it previously known to have a polynomial-time approximation algorithm. Here we describe a polynomial-time approximation scheme (PTAS) for the problem.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Shimon Even Abraham~Lempel and Martin Cohen. An algorithm for optimal prefix parsing of a noiseless and memoryless channel. IEEE Transactions on Information Theory, 19(2):208--214, March 1973.
 
2
Julia Abrahams. Code and parse trees for lossless source encoding. Communications in Information and Systems, 1(2):113--146, April 2001.
3
 
4
Toby Berger and Raymond~W. Yeung. Optimum 1-ended binary prefix codes. IEEE Transactions on Information Theory, 36(6):1434--1441, November 1990.
 
5
N. M. Blachman. Minimum cost coding of information. IRE Transactions on Information Theory, PGIT-3:139--149, 1954.
 
6
N. M. Blachman. Minimum cost coding of information. IRE Transactions on Information Theory, PGIT-3:139--149, 1954.
 
7
N. Cot. Complexity of the variable-length encoding problem. In Proc. 6th Southeast Conference on Combinatorics, Graph Theory and Computing, pages 211--244, 1975.
 
8
Norbert Cott. Characterization and Design of Optimal Prefix Codes. PhD Thesis, Stanford University, Department of Computer Science, June 1957.
 
9
I. Csisz'ar. Simple proofs of some theorems on noiseless channels. Inform. Contr., 514:285--298, 1969.
 
10
E. N. Gilbert. How good is morse code. Inform Control, 14:585--565, 1969.
 
11
E. N. Gilbert. Coding with digits of unequal costs. IEEE Trans. Inform. Theory, 41:596--600, 1995.
 
12
M. Golin and G. Rote. A dynamic programming algorithm for constructing optimal prefix-free codes for unequal letter costs. IEEE Transactions on Information Theory, 44(5):1770--1781, 1998.
 
13
 
14
K. Hinderer. On dichotomous search with direction-dependent costs for a uniformly hidden objec. Optimization, 21(2):215--229, 1990.
 
15
D. A. Huffman. A method for the construction of minimum redundancy codes. In Proc. IRE 40}, volume 10, pages 1098--1101, September 1952.
 
16
K. A. S. Immink. Codes for Mass Data Storage Systems. Shannon Foundations Publishers, 1999.
 
17
I. Itai. Optimal alphabetic trees. SIAM J. Computing, 5:9--18, 1976.
18
 
19
Richard Karp. Minimum-redundancy coding for the discrete noiseless channel. IRE Transactions on Information Theory, IT-7:27--39, January 1961.
 
20
 
21
R. M. Krause. Channels which transmit letters of unequal duration. Inform. Contr., 5:13--24, 1962.
 
22
R.S. Marcus. Discrete Noiseless Coding. M.S. Thesis, MIT E.E. Dept, 1957.
 
23
K. Mehlhorn. An efficient algorithm for constructing nearly optimal prefix codes. IEEE Trans. Inform. Theory, 26:513--517, September 1980.
 
24
25
 
26
A. De Santis R. M. Capocelli and G. Persiano. Binary prefix codes ending in a 1. IEEE Transactions on Information Theory, 40(4):1296--1302, July 1994.
27
28
 
29
B. Varn. Optimal variable length codes (arbitrary symbol cost and equal code word probability). Information Control, 19:289--301, 1971.


Collaborative Colleagues:
Mordecai J. Golin: colleagues
Claire Kenyon: colleagues
Neal E. Young: colleagues