| Histogram-aware sorting for enhanced word-aligned compression in bitmap indexes |
| Full text |
Pdf
(824 KB)
|
Source
|
Data Warehousing and OLAP
archive
Proceeding of the ACM 11th international workshop on Data warehousing and OLAP
table of contents
Napa Valley, California, USA
SESSION: Performance optimization and tuning
table of contents
Pages 1-8
Year of Publication: 2008
ISBN:978-1-60558-250-4
|
|
Authors
|
|
Owen Kaser
|
University of New Brunswick, Saint John, NB, Canada
|
|
Daniel Lemire
|
Université du Québec à Montréal, Montreal, PQ, Canada
|
|
Kamel Aouiche
|
Université du Québec à Montréal, Montreal, PQ, Canada
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 17, Downloads (12 Months): 109, Citation Count: 0
|
|
|
ABSTRACT
Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage. To accelerate logical operations (AND, OR, XOR) over bitmaps, we use techniques based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. These techniques are sensitive to the order of the rows: a simple lexicographical sort can divide the index size by 9 and make indexes several times faster. We investigate reordering heuristics based on computed attribute-value histograms. Simply permuting the columns of the table based on these histograms can increase the sorting efficiency by 40%.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
K. Aouiche, D. Lemire, and O. Kaser. Tri de la table de faits et compression des index bitmaps avec alignement sur les mots. available from http://arxiv.org/abs/0805.3339.
|
| |
3
|
L. Bellatreche, R. Missaoui, H. Necir, and H. Drias. Selection and pruning algorithms for bitmap index selection problem using data mining. LNCS, 4654:221, 2007.
|
| |
4
|
G. Canahuate, H. Ferhatosmanoglu, and A. Pinar. Improving bitmap index compression by data reorganization. http://hpcrd.lbl.gov/~apinar/papers/TKDE06.pdf (checked 2008-05-30), 2006.
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
| |
8
|
K. Davis and A. Gupta. Data Warehouses and OLAP: Concepts, Architectures, and Solutions, chapter Indexing in Data Warehouses. IRM Press, 2007.
|
| |
9
|
S. Hettich and S. D. Bay. The UCI KDD archive. http://kdd.ics.uci.edu (checked 2008-04-28), 2000.
|
| |
10
|
M. Hirabayashi. QDBM: Quick database manager. http://qdbm.sourceforge.net/ (checked 2008-02-22), 2006.
|
| |
11
|
O. Kaser, S. Keith, and D. Lemire. The LitOLAP project: Data warehousing with literature. In CaSTA'06, 2006.
|
 |
12
|
|
| |
13
|
Netflix, Inc. Nexflix prize. http://www.netflixprize.com (checked 2008-04-28), 2007.
|
| |
14
|
|
| |
15
|
|
| |
16
|
Project Gutenberg Literary Archive Foundation. Project Gutenberg. http://www.gutenberg.org/ (checked 2007-05-30), 2007.
|
| |
17
|
|
| |
18
|
Y. Sharma and N. Goyal. An efficient multi-component indexing embedded bitmap compression for data reorganization. Information Technology Journal, 7(1):160--164, 2008.
|
 |
19
|
Kurt Stockinger , Kesheng Wu , Arie Shoshani, Strategies for processing ad hoc queries on large data warehouses, Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP, p.72-79, November 08-08, 2002, McLean, Virginia, USA
[doi> 10.1145/583890.583901]
|
| |
20
|
K. Stockinger, K. Wu, and A. Shoshani. Evaluation strategies for bitmap indices with binning. In DEXA 2004, 2004.
|
| |
21
|
TPC. DBGEN 2.4.0. http://www.tpc.org/tpch/ (checked 2007-12-4), 2006.
|
| |
22
|
|
| |
23
|
Harry K. T. Wong , Hsiu-Fen Liu , Frank Olken , Doron Rotem , Linda Wong, Bit transposed files, Proceedings of the 11th international conference on Very Large Data Bases, p.448-457, August 21-23, 1985, Stockholm, Sweden
|
 |
24
|
|
 |
25
|
|
|