ACM Home Page
Please provide us with feedback. Feedback
Histogram-aware sorting for enhanced word-aligned compression in bitmap indexes
Full text PdfPdf (824 KB)
Source
Data Warehousing and OLAP archive
Proceeding of the ACM 11th international workshop on Data warehousing and OLAP table of contents
Napa Valley, California, USA
SESSION: Performance optimization and tuning table of contents
Pages 1-8  
Year of Publication: 2008
ISBN:978-1-60558-250-4
Authors
Owen Kaser  University of New Brunswick, Saint John, NB, Canada
Daniel Lemire  Université du Québec à Montréal, Montreal, PQ, Canada
Kamel Aouiche  Université du Québec à Montréal, Montreal, PQ, Canada
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 109,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458432.1458434
What is a DOI?

ABSTRACT

Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage. To accelerate logical operations (AND, OR, XOR) over bitmaps, we use techniques based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. These techniques are sensitive to the order of the rows: a simple lexicographical sort can divide the index size by 9 and make indexes several times faster. We investigate reordering heuristics based on computed attribute-value histograms. Simply permuting the columns of the table based on these histograms can increase the sorting efficiency by 40%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
K. Aouiche, D. Lemire, and O. Kaser. Tri de la table de faits et compression des index bitmaps avec alignement sur les mots. available from http://arxiv.org/abs/0805.3339.
 
3
L. Bellatreche, R. Missaoui, H. Necir, and H. Drias. Selection and pruning algorithms for bitmap index selection problem using data mining. LNCS, 4654:221, 2007.
 
4
G. Canahuate, H. Ferhatosmanoglu, and A. Pinar. Improving bitmap index compression by data reorganization. http://hpcrd.lbl.gov/~apinar/papers/TKDE06.pdf (checked 2008-05-30), 2006.
5
6
7
 
8
K. Davis and A. Gupta. Data Warehouses and OLAP: Concepts, Architectures, and Solutions, chapter Indexing in Data Warehouses. IRM Press, 2007.
 
9
S. Hettich and S. D. Bay. The UCI KDD archive. http://kdd.ics.uci.edu (checked 2008-04-28), 2000.
 
10
M. Hirabayashi. QDBM: Quick database manager. http://qdbm.sourceforge.net/ (checked 2008-02-22), 2006.
 
11
O. Kaser, S. Keith, and D. Lemire. The LitOLAP project: Data warehousing with literature. In CaSTA'06, 2006.
12
 
13
Netflix, Inc. Nexflix prize. http://www.netflixprize.com (checked 2008-04-28), 2007.
 
14
 
15
 
16
Project Gutenberg Literary Archive Foundation. Project Gutenberg. http://www.gutenberg.org/ (checked 2007-05-30), 2007.
 
17
 
18
Y. Sharma and N. Goyal. An efficient multi-component indexing embedded bitmap compression for data reorganization. Information Technology Journal, 7(1):160--164, 2008.
19
 
20
K. Stockinger, K. Wu, and A. Shoshani. Evaluation strategies for bitmap indices with binning. In DEXA 2004, 2004.
 
21
TPC. DBGEN 2.4.0. http://www.tpc.org/tpch/ (checked 2007-12-4), 2006.
 
22
 
23
24
25

Collaborative Colleagues:
Owen Kaser: colleagues
Daniel Lemire: colleagues
Kamel Aouiche: colleagues