|
ABSTRACT
Aggregation and duplicate removal are common in SQL queries. However, in the parallel query processing literature, aggregate processing has received surprisingly little attention; furthermore, for each of the traditional parallel aggregation algorithms, there is a range of grouping selectivities where the algorithm performs poorly. In this work, we propose new algorithms that dynamically adapt, at query evaluation time, in response to observed grouping selectivities. Performance analysis via analytical modeling and an implementation on a workstation-cluster shows that the proposed algorithms are able to perform well for all grouping selectivities. Finally, we study the effect of data skew and show that for certain data sets the proposed algorithms can even outperform the best of traditional approaches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
BBDW83
|
|
| |
BCL93
|
|
| |
BF93
|
J. Bunge and M. Fitzpatrick. Estimating the Number of Species: A Review. Journal of the Amemcan Statistical Association, 88(421), March 1993.
|
| |
DGS+90
|
D. J. Dewitt , S. Ghandeharizadeh , D. A. Schneider , A. Bricker , H. -I. Hsiao , R. Rasmussen, The Gamma Database Machine Project, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.44-62, March 1990
[doi> 10.1109/69.50905]
|
| |
Eps79
|
R. Epstein. Techniques for Processing of Aggregates in Relational Database Systems. Memo UCB/ERL M79/8, E.R.L., College of Eng., Univ. of Calif., Berkeley, Feb. 1979.
|
| |
ER61
|
P. Erd5s and A. R~nyi. On a Classical Problem of Probability Theory. MTA Mat. Kut. Int. KSzl, 6A, 1961. Also in Selected Papers of A. Rfinyi, v. 2, Akademiai Kiado, Budapest.
|
 |
Gra93
|
|
| |
Oak93
|
Oak Ridge National Lab. P VM 3 User's Guide and Reference Manual, May 1993.
|
| |
Ses92
|
|
| |
SM82
|
|
| |
TPC94
|
TPC. TPC BenchmarkTM D (Decision Support). Working draft 6.5, Transaction Processing Performance Council, Feb. 1994.
|
| |
WDJ91
|
|
CITED BY 19
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Themistoklis Palpanas , Richard Sidle , Roberta Cochrane , Hamid Pirahesh, Incremental maintenance for non-distributive aggregate functions, Proceedings of the 28th international conference on Very Large Data Bases, p.802-813, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
Sameet Agarwal , Rakesh Agrawal , Prasad Deshpande , Ashish Gupta , Jeffrey F. Naughton , Raghu Ramakrishnan , Sunita Sarawagi, On the Computation of Multidimensional Aggregates, Proceedings of the 22th International Conference on Very Large Data Bases, p.506-521, September 03-06, 1996
|
|
|
Tahsin Kurc , Feng Lee , Gagan Agrawal , Umit Catalyurek , Renato Ferreira , Joel Saltz, Optimizing Reduction Computations In a Distributed Environment, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, p.9, November 15-21, 2003
|
|
|
|
|
|
|
|
|
|
|