| Efficient bulk insertion into a distributed ordered table |
| Full text |
Pdf
(525 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
table of contents
Vancouver, Canada
SESSION: Research Session 16: Transactions and Distribution
table of contents
Pages 765-778
Year of Publication: 2008
ISBN:978-1-60558-102-6
|
|
Authors
|
|
Adam Silberstein
|
Yahoo! Research, Santa Clara, CA, USA
|
|
Brian F. Cooper
|
Yahoo! Research, Santa Clara, CA, USA
|
|
Utkarsh Srivastava
|
Yahoo! Research, Santa Clara, CA, USA
|
|
Erik Vee
|
Yahoo! Research, Santa Clara, CA, USA
|
|
Ramana Yerneni
|
Yahoo! Research, Santa Clara, CA, USA
|
|
Raghu Ramakrishnan
|
Yahoo! Research, Santa Clara, CA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 27, Downloads (12 Months): 275, Citation Count: 2
|
|
|
ABSTRACT
We study the problem of bulk-inserting records into tables in a system that horizontally range-partitions data over a large cluster of shared-nothing machines. Each table partition contains a contiguous portion of the table's key range, and must accept all records inserted into that range. Examples of such systems include BigTable[8] at Google, and PNUTS [15] at Yahoo! During bulk inserts into an existing table, if most of the inserted records end up going into a small number of data partitions, the obtained throughput may be very poor due to ineffective use of cluster parallelism. We propose a novel approach in which a planning phase is invoked before the actual insertions. By creating new partitions and intelligently distributing partitions across machines, the planning phase ensures that the insertion load will be well-balanced. Since there is a tradeoff between the cost of moving partitions and the resulting throughput gain, the planning phase must minimize the sum of partition movement time and insertion time. We show that this problem is a variation of NP-hard bin-packing, reduce it to a problem of packing vectors, and then give a solution with provable approximation guarantees. We evaluate our approach on a prototype system deployed on a cluster of 50 machines, and show that it yields significant improvements over more naïve techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
GridFTP. http://www.globus.org/grid_software/data/gridftp.php.
|
| |
2
|
Hacmp for system p servers. http://www-03.ibm.com/systems/p/software/hacmp/index.html.
|
| |
3
|
Oracle real application clusters 11g. http://www.oracle.com/technology/products/database/clustering/index.html.
|
| |
4
|
Scalability and performance with oracle 11g database. http://www.oracle.com/technology/deploy/performance/pdf/db_perfscale_11gr1_twp.pdf, 2007.
|
 |
5
|
Marcos K. Aguilera , Arif Merchant , Mehul Shah , Alistair Veitch , Christos Karamanolis, Sinfonia: a new paradigm for building scalable distributed systems, Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles, October 14-17, 2007, Stevenson, Washington, USA
|
| |
6
|
P. Bernstein, N. Dani, B. Khessib, R. Manne, and D. Shutt. Data management issues in supporting large-scale web services. IEEE Data Engineering Bulletin, December 2006.
|
| |
7
|
L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web caching and zipf-like distributions: Evidence and implications. In Proc. INFOCOM, 1999.
|
| |
8
|
Fay Chang , Jeffrey Dean , Sanjay Ghemawat , Wilson C. Hsieh , Deborah A. Wallach , Mike Burrows , Tushar Chandra , Andrew Fikes , Robert E. Gruber, Bigtable: a distributed storage system for structured data, Proceedings of the 7th symposium on Operating systems design and implementation, November 06-08, 2006, Seattle, Washington
|
| |
9
|
|
| |
10
|
B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. PNUTS: Yahoo!'s hosted data serving platform. Technical report, Yahoo! Research, 2008.
|
| |
11
|
|
 |
12
|
Giuseppe DeCandia , Deniz Hastorun , Madan Jampani , Gunavardhan Kakulapati , Avinash Lakshman , Alex Pilchin , Swaminathan Sivasubramanian , Peter Vosshall , Werner Vogels, Dynamo: amazon's highly available key-value store, Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles, October 14-17, 2007, Stevenson, Washington, USA
|
| |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
S. Hung and J. Fisk. An algorithm for 0-1 multiple knapsack problems. In Naval Res. Logist. Quart., pages 571--579, 1978.
|
| |
17
|
|
| |
18
|
R. Loulou and E. Michaelides. New greedy-like heuristics for the multidimensional 0-1 knapsack problem. Operations Research, 27:1101--1114, 1979.
|
| |
19
|
|
| |
20
|
S. Mishra. Loading bulk data into a partitioned table. http://www.microsoft.com/technet/prodtechnol/sql/bestpractice/loading_bulk_data_partitioned_table.mspx, September 2006.
|
 |
21
|
|
| |
22
|
|
| |
23
|
S. Senju and Y. Toyoda. An approach to linear programming with 0-1 variables. Management Science, 15(4):B196--B207, 1968.
|
| |
24
|
|
| |
25
|
|
| |
26
|
M. Vasquez and J. Hao. A hybrid approach for the 01 multidimensional knapsack problem. In IJCAI, pages 328--333, 2001.
|
| |
27
|
Sage A. Weil , Scott A. Brandt , Ethan L. Miller , Darrell D. E. Long , Carlos Maltzahn, Ceph: a scalable, high-performance distributed file system, Proceedings of the 7th symposium on Operating systems design and implementation, November 06-08, 2006, Seattle, Washington
|
| |
28
|
|
CITED BY 2
|
|
|
|
|
Brian F. Cooper , Raghu Ramakrishnan , Utkarsh Srivastava , Adam Silberstein , Philip Bohannon , Hans-Arno Jacobsen , Nick Puz , Daniel Weaver , Ramana Yerneni, PNUTS: Yahoo!'s hosted data serving platform, Proceedings of the VLDB Endowment, v.1 n.2, August 2008
|
|