|
ABSTRACT
This paper describes the Query Rewrite facility of the Starburst extensible database system, a novel phase of query optimization. We present a suite of rewrite rules used in Starburst to transform queries into equivalent queries for faster execution, and also describe the production rule engine which is used by Starburst to choose and execute these rules. Examples are provided demonstrating that these Query Rewrite transformations lead to query execution time improvements of orders of magnitude, suggesting that Query Rewrite in general—and these rewrite rules in particular—are an essential step in query optimization for modern database systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
ABC+76
|
M. M. Astrahan , M. W. Blasgen , D. D. Chamberlin , K. P. Eswaran , J. N. Gray , P. P. Griffiths , W. F. King , R. A. Lorie , P. R. McJones , J. W. Mehl , G. R. Putzolu , I. L. Traiger , B. W. Wade , V. Watson, System R: relational approach to database management, ACM Transactions on Database Systems (TODS), v.1 n.2, p.97-137, June 1976
[doi> 10.1145/320455.320457]
|
| |
Anf89
|
Ole Jirgen Anfindsen.A Study of Access Path Selection in DB2. Technical report, Norwegian Telecommunications Administration and University of Oslo, Norway, October 1989.
|
| |
BFKM85
|
|
| |
BTA90
|
Jose Blakeley, Craig Thompson, and Abdallah Alashqur. Strawman reference model for object query language. In First OODB Standardization Workshop, X3/SPARC/DBSSG/OODBTG, Atlantic City, New Jersey, May 1990.
|
| |
Day87
|
|
 |
GW87
|
|
| |
HCL+90
|
L. M. Haas , W. Chang , G. M. Lohman , J. McPherson , P. F. Wilms , G. Lapis , B. Lindsay , H. Pirahesh , M. J. Carey , E. Shekita, Starburst Mid-Flight: As the Dust Clears, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.143-160, March 1990
[doi> 10.1109/69.50910]
|
| |
HH91
|
Joseph M. Hellerstein and Meichun Hsu. Determinism in Partially Ordered Production Systems. Research Report RJ 8089, IBM Almaden Research Center, March 1991.
|
| |
HP88
|
Waqar Hasan and Hamid Pirahesh. Query Rewrite Optimization in Starburst. Research Report RJ 6367, IBM Almaden Research Center, August 1988.
|
| |
HSS88
|
T. Harder , H. Schoning , A. Sikeler, Parallelism in processing queries on complex objects, Proceedings of the first international symposium on Databases in parallel and distributed systems, p.131-143, December 05-07, 1988, Austin, Texas, United States
|
| |
ISO91
|
ISO_ANSI. Database Language SQL ISOBEC 9075:1992, 1991.
|
 |
Kim82
|
|
 |
LLOW91
|
|
 |
LLPS91
|
Guy M. Lohman , Bruce Lindsay , Hamid Pirahesh , K. Bernhard Schiefer, Extensions to Starburst: objects, types, functions, and rules, Communications of the ACM, v.34 n.10, p.94-109, Oct. 1991
[doi> 10.1145/125223.125266]
|
| |
Loo86
|
Chris Loosley. Measuring IBM Database 2 Release 2 - The Benchmark Game. InfoDB, 1(2), 1986.
|
| |
MF78
|
J. McDermott and C. Forgy. Production System Conflict Resolution Strategies. In D.A. Waterman and Fredrick Hayes-Roth, editors, Pattern Directed Inference Systems, pages 177-199. Academic Press, 1978.
|
 |
MFPR90a
|
I. S. Mumick , S. J. Finkelstein , Hamid Pirahesh , Raghu Ramakrishnan, Magic is relevant, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.247-258, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
MFPR90b
|
Inderpal Singh Mumick , Sheldon J. Finkelstein , Hamid Pirahesh , Raghu Ramakrishnan, Magic conditions, Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.314-330, April 02-04, 1990, Nashville, Tennessee, United States
[doi> 10.1145/298514.298584]
|
| |
MPR90
|
|
| |
O'N89
|
P. O'Neil. Revisiting DBMS Benchmarks. Datamation, pages 47-54, September 15, 1989.
|
| |
Pir89
|
Hamid Pirahesh. Early Experience with Rule-Based Query Rewrite Optimization. In G. Graefe, editor, Workshop on Database Query Optimization, CSE Technical Report 89-005. Oregon Graduate Center, May 1989.
|
| |
Pro90
|
Proc. ACM-SIGMOD International Conference on Management of Data, Atlantic City, May 1990.
|
| |
Ras90
|
Louiqa Raschid. Maintaining Consistency in a Stratified Production System Program. In Proc. AAAI National Conference on Artificial Intelligence, 1990.
|
 |
RGL90
|
Arnon Rosenthal , Cesar Galindo-Legaria, Query graphs, implementing trees, and freely-reorderable outerjoins, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.291-299, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
SAC+79
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
 |
SJGP90
|
Michael Stonebraker , Anant Jhingran , Jeffrey Goh , Spyros Potamianos, On rules, procedure, caching and views in data base systems, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.281-290, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
SWK76
|
|
| |
TOB89
|
C. Turbyfill, C. Orji, and Dina Bitton. AS3AP - A Comparative Relational Database Benchmark. In Proc. IEEE Compcon Spring '89, February 1989.
|
| |
ZH90
|
|
CITED BY 80
|
|
|
|
|
|
|
|
|
|
|
Lane B. Warshaw , Daniel P. Miranker, Rule-based query optimization, revisited, Proceedings of the eighth international conference on Information and knowledge management, p.267-275, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Francis Chu , Joseph Y. Halpern , Praveen Seshadri, Least expected cost query optimization: an exercise in utility, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.138-147, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, No regression algorithm for the enumeration of projections in SQL queries with joins and outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.8, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, No regression algorithm for the enumeration of projections in SQL queries with joins and outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.8, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
Gene Fuh , Jyh-Herng Chow , Nelson Mattos , Brian Tran, Supporting procedural constructs in existing SQL compilers, Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research, p.11, November 12-14, 1996, Toronto, Ontario, Canada
|
|
|
H. V. Jagadish , Alberto O. Mendelzon , Inderpal Singh Mumick, Managing conflicts between rules (extended abstract), Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.192-201, June 04-06, 1996, Montreal, Quebec, Canada
|
|
|
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, Simplification of outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.7, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
R. Telford , R. Horman , S. Lightstone , N. Markov , S. O'Connell , G. Lohman, Usability and design considerations for an autonomic relational database management system, IBM Systems Journal, v.42 n.4, p.568-581, October 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kevin Beyer , Roberta J. Cochrane , Vanja Josifovski , Jim Kleewein , George Lapis , Guy Lohman , Bob Lyle , Fatma Özcan , Hamid Pirahesh , Normen Seemann , Tuong Truong , Bert Van der Linden , Brian Vickery , Chun Zhang, System RX: one part relational, one part XML, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
|
|
|
|
|
|
|
|
|
|
|
|
A. Balmin , T. Eliaz , J. Hornibrook , L. Lim , G. M. Lohman , D. Simmen , M. Wang , C. Zhang, Cost-based optimization in DB2 XML, IBM Systems Journal, v.45 n.2, p.299-319, January 2006
|
|
|
K. Beyer , R. Cochrane , M. Hvizdos , V. Josifovski , J. Kleewein , G. Lapis , G. Lohman , R. Lyle , M. Nicola , F. Özcan , H. Pirahesh , N. Seemann , A. Singh , T. Truong , R. C. Van der Linden , B. Vickery , C. Zhang , G. Zhang, DB2 goes hybrid: integratng native XML and XQuery with relational data and SQL, IBM Systems Journal, v.45 n.2, p.271-298, January 2006
|
|
|
|
|
|
Irina Botan , Donald Kossmann , Peter M. Fischer , Tim Kraska , Dana Florescu , Rokas Tamosevicius, Extending XQuery with window functions, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
|
|
|
Rafi Ahmed , Allison Lee , Andrew Witkowski , Dinesh Das , Hong Su , Mohamed Zait , Thierry Cruanes, Cost-based query transformation in Oracle, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
Lewis Girod , Kyle Jamieson , Yuan Mei , Ryan Newton , Stanislav Rost , Arvind Thiagarajan , Hari Balakrishnan , Samuel Madden, WaveScope: a signal-oriented data stream management system, Proceedings of the 4th international conference on Embedded networked sensor systems, October 31-November 03, 2006, Boulder, Colorado, USA
|
|
|
|
|
|
Rakesh Agrawal , Roberto Bayardo , Christos Faloutsos , Jerry Kiernan , Ralf Rantzau , Ramakrishnan Srikant, Auditing compliance with a Hippocratic database, Proceedings of the Thirtieth international conference on Very large data bases, p.516-527, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|
|
Benoit Dageville , Dinesh Das , Karl Dias , Khaled Yagoub , Mohamed Zait , Mohamed Ziauddin, Automatic SQL tuning in oracle 10g, Proceedings of the Thirtieth international conference on Very large data bases, p.1098-1109, August 31-September 03, 2004, Toronto, Canada
|
|
|
Tobias Kraft , Holger Schwarz , Ralf Rantzau , Bernhard Mitschang, Coarse-grained optimization: techniques for rewriting SQL statement sequences, Proceedings of the 29th international conference on Very large data bases, p.488-499, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
Jayavel Shanmugasundaram , Eugene Shekita , Rimon Barr , Michael Carey , Bruce Lindsay , Hamid Pirahesh , Berthold Reinwald, Efficiently publishing relational data as XML documents, The VLDB Journal — The International Journal on Very Large Data Bases, v.10 n.2-3, p.133-154, September 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qi Cheng , Jarek Gryz , Fred Koo , T. Y. Cliff Leung , Linqi Liu , Xiaoyan Qian , K. Bernhard Schiefer, Implementation of Two Semantic Query Optimization Techniques in DB2 Universal Database, Proceedings of the 25th International Conference on Very Large Data Bases, p.687-698, September 07-10, 1999
|
|
|
|
|
|
|
|
|
|
|
|
Marko Vrhovnik , Holger Schwarz , Oliver Suhre , Bernhard Mitschang , Volker Markl , Albert Maier , Tobias Kraft, An approach to optimize data processing in business processes, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
Andrew Witkowski , Srikanth Bellamkonda , Hua-Gang Li , Vince Liang , Lei Sheng , Wayne Smith , Sankar Subramanian , James Terry , Tsae-Feng Yu, Continuous queries in oracle, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
Guangjun Xie , Qi Cheng , Jarek Gryz , Calisto Zuzarte, Some rewrite optimizations of DB2 XQuery navigation, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Mingsheng Hong , Mirek Riedewald , Christoph Koch , Johannes Gehrke , Alan Demers, Rule-based multi-query optimization, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
|
|
|
|
|
|
|
|