ACM Home Page
Please provide us with feedback. Feedback
Modeling and exploiting query interactions in database systems
Full text PdfPdf (2.71 MB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
SESSION: DB: efficient maintenance and query optimization table of contents
Pages 183-192  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Mumtaz Ahmad  University of Waterloo, Waterloo, ON, Canada
Ashraf Aboulnaga  University of Waterloo, Waterloo, ON, Canada
Shivnath Babu  Duke University, Durham, NC, USA
Kamesh Munagala  Duke University, Durham, NC, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 153,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458109
What is a DOI?

ABSTRACT

The typical workload in a database system consists of a mixture of multiple queries of different types, running concurrently and interacting with each other. Hence, optimizing performance requires reasoning about query mixes and their interactions, rather than considering individual queries or query types. In this paper, we show the significant impact that query interactions can have on workload performance. We present a new approach based on planning experiments and statistical modeling to capture the impact of query interactions. This approach requires no prior assumptions about the internal workings of the database system or the nature or cause of query interactions, making it portable across systems. As a concrete demonstration of the potential of capturing, modeling, and exploiting query interactions, we develop a novel interaction-aware query scheduler that targets report-generation workloads in Business Intelligence (BI) settings. Under certain assumptions, the schedule found by this scheduler is within a constant factor of optimal. An experimental evaluation with TPC-H queries on IBM DB2 demonstrates that our scheduler consistently outperforms (up to 4x) conventional schedulers that do not account for query interactions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
M. Ahmad, A. Aboulnaga, S. Babu, and K. Munagala. Qshuffler: Getting the query mix right. In ICDE, 2008.
5
 
6
 
7
Business objects. http://www.businessobjects.com/.
 
8
 
9
Cognos. http://www.cognos.com/.
 
10
R. H. Conway, W. L. Maxwell, and M. L. W. Theory of scheduling. Addison-Wesley, 1967.
 
11
CPLEX. http://www.ilog.com/products/cplex/.
12
 
13
 
14
15
 
16
 
17
J. Kyoung-Don Kang Son, S.H. Stankovic. Service differentiation in real-time main memory databases. In ISORC, 2002.
 
18
W.-Y. Loh. Regression trees with unbiased variable selection and interaction detection. Statistica Sinica, 12:361--386, 2002.
 
19
 
20
21
 
22
 
23
 
24
25
 
26
H. J. Ryser. Combinatorial Mathematics. The Mathematical Association of America, 1963.
27
 
28
29
 
30
 
31
Skewed TPC-D data generator. ftp://ftp.research.microsoft.com/users/viveknar/TPCDSkew/.
32
 
33
M. Welsh and D. Culler. Adaptive overload control for busy internet servers. In USITS, 2003.
 
34
 
35
Q. Zhang, L. Cherkasova, G. Mathews, W. Greene, and E. Smirni. R-capriccio: A capacity planning and anomaly detection tool for enterprise services with live workloads. In Middleware, 2007.
 
36


Collaborative Colleagues:
Mumtaz Ahmad: colleagues
Ashraf Aboulnaga: colleagues
Shivnath Babu: colleagues
Kamesh Munagala: colleagues