ACM Home Page
Please provide us with feedback. Feedback
Compression boosting in optimal linear time using the Burrows-Wheeler Transform
Full text PdfPdf (277 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
New Orleans, Louisiana
SESSION: Session 8A table of contents
Pages: 655 - 663  
Year of Publication: 2004
ISBN:0-89871-558-X
Authors
Paolo Ferragina  Università di Pisa, Italy
Giovanni Manzini  Università del Piemonte Orientale, Alessandria, Italy
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 49,   Citation Count: 3
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

In this paper we provide the first compression booster that turns a zeroth order compressor into a more effiective k-th order compressor without any loss in time efficiency. More precisely, let A be an algorithm that compresses a string s within λ|s|H*0(s)+μ bits of storage in O(T (|s|)) time, where H*0(s) is the zeroth order entropy of the string s. Our booster improves A by compressing s within λ|s|H*0(S) + log2 |s| + gk bits still using O(T (|s|)) time, where H*k(s) is the k-th order entropy of s.The idea of a "compression booster" has been very recently introduced by Giancarlo and Sciortino in [7]. They combined the Burrows-Wheeler Transform [3] with dynamic programming and achieved our same compression bound but with running time O(T (|s|)) + Ω(|s|2). We start from the same premises of [7], but instead of using dynamic programming we design a linear time optimization algorithm based on novel structural properties of the Burrows-Wheeler Transform.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
M. Burrows and D. Wheeler. A block sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation, 1994.
 
4
 
5
P. Fenwick. The Burrows-Wheeler transform for block sorting text compression: principles and improvements. The Computer Journal, 39(9):731--740, 1996.
 
6
 
7
R. Giancarlo and M. Sciortino. Optimal partitions of strings: A new class of Burrows-Wheeler compression algorithms. In Combinatorial Pattern Matching Conference (CPM '03), pages 129--143, 2003.
 
8
9
10
 
11
J. Seward. The bzip2 home page, 1997. http://sources.redhat.com/bzip2.
 
12

Collaborative Colleagues:
Paolo Ferragina: colleagues
Giovanni Manzini: colleagues