| Compression boosting in optimal linear time using the Burrows-Wheeler Transform |
| Full text |
Pdf
(277 KB)
|
| Source
|
Symposium on Discrete Algorithms
archive
Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms
table of contents
New Orleans, Louisiana
SESSION: Session 8A
table of contents
Pages: 655 - 663
Year of Publication: 2004
ISBN:0-89871-558-X
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
Society for Industrial and Applied Mathematics
Philadelphia, PA, USA
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 49, Citation Count: 3
|
|
|
ABSTRACT
In this paper we provide the first compression booster that turns a zeroth order compressor into a more effiective k-th order compressor without any loss in time efficiency. More precisely, let A be an algorithm that compresses a string s within λ|s|H*0(s)+μ bits of storage in O(T (|s|)) time, where H*0(s) is the zeroth order entropy of the string s. Our booster improves A by compressing s within λ|s|H*0(S) + log2 |s| + gk bits still using O(T (|s|)) time, where H*k(s) is the k-th order entropy of s.The idea of a "compression booster" has been very recently introduced by Giancarlo and Sciortino in [7]. They combined the Burrows-Wheeler Transform [3] with dynamic programming and achieved our same compression bound but with running time O(T (|s|)) + Ω(|s|2). We start from the same premises of [7], but instead of using dynamic programming we design a linear time optimization algorithm based on novel structural properties of the Burrows-Wheeler Transform.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
M. Burrows and D. Wheeler. A block sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation, 1994.
|
| |
4
|
|
| |
5
|
P. Fenwick. The Burrows-Wheeler transform for block sorting text compression: principles and improvements. The Computer Journal, 39(9):731--740, 1996.
|
| |
6
|
|
| |
7
|
R. Giancarlo and M. Sciortino. Optimal partitions of strings: A new class of Burrows-Wheeler compression algorithms. In Combinatorial Pattern Matching Conference (CPM '03), pages 129--143, 2003.
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
J. Seward. The bzip2 home page, 1997. http://sources.redhat.com/bzip2.
|
| |
12
|
|
|