| Orthogonal optimization of subqueries and aggregation |
| Full text |
Pdf
(257 KB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 2001 ACM SIGMOD international conference on Management of data
table of contents
Santa Barbara, California, United States
Pages: 571 - 581
Year of Publication: 2001
ISBN:1-58113-332-4
Also published in ...
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 97, Citation Count: 22
|
|
|
ABSTRACT
There is considerable overlap between strategies proposed for subquery evaluation, and those for grouping and aggregation. In this paper we show how a number of small, independent primitives generate a rich set of efficient execution strategies —covering standard proposals for subquery evaluation suggested in earlier literature. These small primitives fall into two main, orthogonal areas: Correlation removal, and efficient processing of outerjoins and GroupBy. An optimization approach based on these pieces provides syntax-independence of query processing with respect to subqueries, i. e. equivalent queries written with or without subquery produce the same efficient plan.
We describe techniques implemented in Microsoft SQL Server (releases 7.0 and 8.0) for queries containing sub-queries and/or aggregations, based on a number of orthogonal optimizations. We concentrate separately on removing correlated subqueries, also called “query flattening,” and on efficient execution of queries with aggregations. The end result is a modular, flexible implementation, which produces very efficient execution plans. To demonstrate the validity of our approach, we present results for some queries from the TPC-H benchmark. From all published TPC-H results in the 300GB scale, at the time of writing (November 2000), SQL Server has the fastest results on those queries, even on a fraction of the processors used by other systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
C. A. Galindo-Legaria. Parameterized queries and nesting equivalences. Technical report, Microsoft, 2001. MSR-TR-2000-31.
|
 |
7
|
|
| |
8
|
G. Graefe. The Cascades framework for query optimization. Data Engineering Bulletin, 18(3):19-29, 1995.
|
| |
9
|
|
| |
10
|
M. M. Joshi and C. A. Galindo-Legaria. Properties of the GroupBy/Aggregate relational operator. Technical report, Microsoft, 2001. MSR-TR-2001-13.
|
 |
11
|
|
| |
12
|
Theodore W. Leung , Gail Mitchell , Bharathi Subramanian , Bennet Vance , Scott L. Vandenberg , Stanley B. Zdonik, The AQUA Data Model and Algebra, Proceedings of the Fourth International Workshop on Database Programming Languages - Object Models and Languages, p.157-175, August 30-September 01, 1993
|
| |
13
|
|
 |
14
|
Hamid Pirahesh , Joseph M. Hellerstein , Waqar Hasan, Extensible/rule based query rewrite optimization in Starburst, Proceedings of the 1992 ACM SIGMOD international conference on Management of data, p.39-48, June 02-05, 1992, San Diego, California, United States
|
| |
15
|
|
| |
16
|
|
| |
17
|
Q. Wang, D. Maier, and L. Shapiro. Algebraic unnesting of nested object queries. Technical report, Oregon Graduate Institute, 1999. CSE-99-013.
|
| |
18
|
|
CITED BY 22
|
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Daniela Florescu , Michael J. Carey , Ioana Manolescu , Ralph Busse, Why and how to benchmark XML databases, ACM SIGMOD Record, v.30 n.3, September 2001
|
|
|
|
|
|
Calisto Zuzarte , Hamid Pirahesh , Wenbin Ma , Qi Cheng , Linqi Liu , Kwai Wong, WinMagic: subquery elimination using window aggregation, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, June 09-12, 2003, San Diego, California
|
|
|
|
|
|
Shankar Pal , Istvan Cseri , Oliver Seeliger , Michael Rys , Gideon Schaller , Wei Yu , Dragan Tomic , Adrian Baras , Brandon Berg , Denis Churin , Eugene Kogan, XQuery implementation in a relational database system, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nikos Karayannidis , Aris Tsois , Timos Sellis , Roland Pieringer , Volker Markl , Frank Ramsak , Robert Fenk , Klaus Elhardt , Rudolf Bayer, Processing star queries on hierarchically-clustered fact tables, Proceedings of the 28th international conference on Very Large Data Bases, p.730-741, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
Loredana Afanasiev , Torsten Grust , Maarten Marx , Jan Rittinger , Jens Teubner, Recursion in XQuery: put your distributivity safety belt on, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
Mihnea Andrei , Xun Cheng , Sudipto Chowdhuri , Curtis Johnson , Edwin Seputis, Ordering, distinctness, aggregation, partitioning and DQP optimization in sybase ASE 15, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|