| Modeling and exploiting query interactions in database systems |
| Full text |
Pdf
(2.71 MB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 17th ACM conference on Information and knowledge management
table of contents
Napa Valley, California, USA
SESSION: DB: efficient maintenance and query optimization
table of contents
Pages 183-192
Year of Publication: 2008
ISBN:978-1-59593-991-3
|
|
Authors
|
|
Mumtaz Ahmad
|
University of Waterloo, Waterloo, ON, Canada
|
|
Ashraf Aboulnaga
|
University of Waterloo, Waterloo, ON, Canada
|
|
Shivnath Babu
|
Duke University, Durham, NC, USA
|
|
Kamesh Munagala
|
Duke University, Durham, NC, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 151, Citation Count: 1
|
|
|
ABSTRACT
The typical workload in a database system consists of a mixture of multiple queries of different types, running concurrently and interacting with each other. Hence, optimizing performance requires reasoning about query mixes and their interactions, rather than considering individual queries or query types. In this paper, we show the significant impact that query interactions can have on workload performance. We present a new approach based on planning experiments and statistical modeling to capture the impact of query interactions. This approach requires no prior assumptions about the internal workings of the database system or the nature or cause of query interactions, making it portable across systems. As a concrete demonstration of the potential of capturing, modeling, and exploiting query interactions, we develop a novel interaction-aware query scheduler that targets report-generation workloads in Business Intelligence (BI) settings. Under certain assumptions, the schedule found by this scheduler is within a constant factor of optimal. An experimental evaluation with TPC-H queries on IBM DB2 demonstrates that our scheduler consistently outperforms (up to 4x) conventional schedulers that do not account for query interactions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
M. Ahmad, A. Aboulnaga, S. Babu, and K. Munagala. Qshuffler: Getting the query mix right. In ICDE, 2008.
|
 |
5
|
Stefan Aulbach , Torsten Grust , Dean Jacobs , Alfons Kemper , Jan Rittinger, Multi-tenant databases for software as a service: schema-mapping techniques, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
[doi> 10.1145/1376616.1376736]
|
| |
6
|
|
| |
7
|
Business objects. http://www.businessobjects.com/.
|
| |
8
|
|
| |
9
|
Cognos. http://www.cognos.com/.
|
| |
10
|
R. H. Conway, W. L. Maxwell, and M. L. W. Theory of scheduling. Addison-Wesley, 1967.
|
| |
11
|
CPLEX. http://www.ilog.com/products/cplex/.
|
 |
12
|
Sameh Elnikety , Erich Nahum , John Tracey , Willy Zwaenepoel, A method for transparent admission control and request scheduling in e-commerce web sites, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
[doi> 10.1145/988672.988710]
|
| |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
J. Kyoung-Don Kang Son, S.H. Stankovic. Service differentiation in real-time main memory databases. In ISORC, 2002.
|
| |
18
|
W.-Y. Loh. Regression trees with unbiased variable selection and interaction detection. Statistica Sinica, 12:361--386, 2002.
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
 |
25
|
Prasan Roy , S. Seshadri , S. Sudarshan , Siddhesh Bhobe, Efficient and extensible algorithms for multi query optimization, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.249-260, May 15-18, 2000, Dallas, Texas, United States
|
| |
26
|
H. J. Ryser. Combinatorial Mathematics. The Mathematical Association of America, 1963.
|
 |
27
|
|
| |
28
|
|
 |
29
|
|
| |
30
|
|
| |
31
|
Skewed TPC-D data generator. ftp://ftp.research.microsoft.com/users/viveknar/TPCDSkew/.
|
 |
32
|
|
| |
33
|
M. Welsh and D. Culler. Adaptive overload control for busy internet servers. In USITS, 2003.
|
| |
34
|
|
| |
35
|
Q. Zhang, L. Cherkasova, G. Mathews, W. Greene, and E. Smirni. R-capriccio: A capacity planning and anomaly detection tool for enterprise services with live workloads. In Middleware, 2007.
|
| |
36
|
|
|