| On parallel execution of multiple pipelined hash joins |
| Full text |
Pdf
(1.24 MB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1994 ACM SIGMOD international conference on Management of data
table of contents
Minneapolis, Minnesota, United States
Pages: 185 - 196
Year of Publication: 1994
ISBN:0-89791-639-5
Also published in ...
|
|
Authors
|
|
Hui-I Hsiao
|
IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY
|
|
Ming-Syan Chen
|
IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY
|
|
Philip S. Yu
|
IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): , Downloads (12 Months): , Citation Count: 12
|
|
|
ABSTRACT
In this paper we study parallel execution of multiple pipelined hash joins. Specifically, we deal with two issues, processor allocation and the use of hash filters, to improve parallel execution of hash joins. We first present a scheme to transform a bushy execution tree to an allocation tree, where each node denotes a pipeline. Then, processors are allocated to the nodes in the allocation tree based on the concept of synchronous execution time such that inner relations (i.e., hash tables) in a pipeline can be made available approximately the same time. In addition, the approach of hash filtering is investigated to further improve the overall performance. Performance studies are conducted via simulation to demonstrate the importance of processor allocation and to evaluate various schemes using hash filters. Simulation results indicate that processor allocation based on the allocation tree significantly outperforms that based on the original bushy tree, and that the effect of hash filtering becomes prominent as the number of relations in a query increases.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
H. Boral , W. Alexander , L. Clay , G. Copeland , S. Danforth , M. Franklin , B. Hart , M. Smith , P. Valduriez, Prototyping Bubba, A Highly Parallel Database System, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.4-24, March 1990
[doi> 10.1109/69.50903]
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
D. J. DeWitt and R. Gerber. Multiprocessor Hash-Based Join Algorithms. Proceedings of the 11#h International Conference on Very Large Data Bases, pages 151-162, August 1985.
|
| |
7
|
D. J. Dewitt , S. Ghandeharizadeh , D. A. Schneider , A. Bricker , H. -I. Hsiao , R. Rasmussen, The Gamma Database Machine Project, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.44-62, March 1990
[doi> 10.1109/69.50905]
|
 |
8
|
|
 |
9
|
Sumit Ganguly , Waqar Hasan , Ravi Krishnamurthy, Query optimization for parallel execution, Proceedings of the 1992 ACM SIGMOD international conference on Management of data, p.9-18, June 02-05, 1992, San Diego, California, United States
|
 |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
 |
15
|
Ming-Ling Lo , Ming-Syan Syan Chen , C. V. Ravishankar , Philip S. Yu, On optimal processor allocation to support pipelined hash joins, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.69-78, May 25-28, 1993, Washington, D.C., United States
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
D. Schneider. Complex Query Processing in Multiprocessor Database Machines. Technical Report Tech. Rep. 965, Computer Science Department# University of Wisconsin-Madison, September 1990.
|
 |
20
|
|
| |
21
|
|
 |
22
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
| |
23
|
|
| |
24
|
|
 |
25
|
|
| |
26
|
|
 |
27
|
Joel L. Wolf , John Turek , Ming-Syan Chen , Philip S. Yu, Scheduling multiple queries on a parallel machine, Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.45-55, May 16-20, 1994, Nashville, Tennessee, United States
|
| |
28
|
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
N. Tomov , E. Dempster , M. H. Williams , A. Burger , H. Taylor , P. J. B. King , P. Broughton, Analytical response time estimation in parallel relational database systems, Parallel Computing, v.30 n.2, p.249-283, February 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|