| How to barter bits for chronons: compression and bandwidth trade offs for database scans |
| Full text |
Pdf
(268 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
table of contents
Beijing, China
SESSION: Storage engine and access methods
table of contents
Pages: 389 - 400
Year of Publication: 2007
ISBN:978-1-59593-686-8
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 107, Citation Count: 5
|
|
|
ABSTRACT
Two trends are converging to make the CPU cost of a table scan a more important component of database performance. First, table scans are becoming a larger fraction of the query processing workload, and second, large memories and compression are making table scans CPU, rather than disk bandwidth, bound. Data warehouse systems have found that they can avoid the unpredictability of joins and indexing and achieve good performance by using massive parallel processing to perform scans over compressed vertical partitions of a denormalized schema. In this paper we present a study of how to make such scans faster by the use of a scan code generator that produces code tuned to the database schema, the compression dictionaries, the queries being evaluated and the target CPU architecture. We investigate a variety of compression formats and propose two novel optimizations: tuple length quantization and a field length lookup table, for efficiently processing variable length fields and tuples. We present a detailed experimental study of the performance of generated scans against these compression formats, and use this to explore the trade off between compression quality and scan speed. We also introduce new strategies for removing instruction-level dependencies and increasing instruction-level parallelism, allowing for greater exploitation of multi-issue processors.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
D. Abadi, S. Madden, and M. Ferreira. Integrating compression and execution in column-oriented database systems. In SIGMOD, 2006.
|
| |
2
|
G. Antoshenkov, D. Lomet, and J. Murray. Order preserving string compression. In ICDE, 1996.
|
| |
3
|
P. Boncz, M. Zukowski, and N. Nes. MonetDB/X100: Hyper-pipelining query execution. In CIDR, 2005.
|
| |
4
|
Z. Chen, J. Gehrke, and F. Korn. Query optimization in compressed database systems. In SIGMOD, 2001.
|
 |
5
|
|
| |
6
|
J. Goldstein, R. Ramakrishnan, and U. Shaft. Compressing relations and indexes. In ICDE, 1998.
|
| |
7
|
G. Graefe and L. D. Shapiro. Data compression and database performance. In Proc. ACM/IEEE-CS Symp. on Applied Computing, 1991.
|
| |
8
|
S. Harizopoulos, V. Liang, D. Abadi, and S. Madden. Performance tradeoffs in read-optimized databases. In VLDB, 2006.
|
| |
9
|
D. Huffman. A method for the construction of minimum-redundancy codes. In Proceedings of the I.R.E., pages 1098--1102, 1952.
|
| |
10
|
B. R. Iyer and D. Wilhite. Data compression support in databases. In VLDB, 1994.
|
| |
11
|
T. Lehman and M. Carey. Query processing in main memory database management systems. In SIGMOD, 1986.
|
| |
12
|
R. MacNicol and B. French. Sybase IQ Multiplex-Designed for analytics. In VLDB, 2004.
|
 |
13
|
|
| |
14
|
MPöss and D. Potapov. Data compression in Oracle. In VLDB, 2003.
|
| |
15
|
V. Raman and G. Swart. Entropy compression of relations and querying of compressed relations. In VLDB, 2006.
|
| |
16
|
G. Ray, J. Haritsa, and S. Seshadri. Database compression: A performance enhancement tool. In COMAD, 1995.
|
| |
17
|
M. Stonebraker et al. C-store: a column-oriented DBMS. In VLDB, 2005.
|
 |
18
|
|
 |
19
|
|
| |
20
|
A. Zandi, B. Iyer, and G. Langdon. Sort order preserving data compression for extended alphabets. In Data Compression Conference, 1993.
|
| |
21
|
M. Zukowski, S. Heman, N. Nes, and P. A. Boncz. Super-Scalar RAM-CPU Cache Compression. In ICDE, April 2006.
|
|