|
ABSTRACT
In real-life applications, different subsets of data may have distinct statistical properties, e.g., various websites may have diverse visitation rates, different categories of stocks may have dissimilar price fluctuation patterns. For such applications, it can be fruitful to eliminate the commonly made single execution plan assumption and instead execute a query using several plans, each optimally serving a subset of data with particular statistical properties. Furthermore, in dynamic environments, data properties may change continuously, thus calling for adaptivity. The intriguing question is: can we have an execution strategy that (1) is plan-based to leverage on all the benefits of traditional plan-based systems, (2) supports multiple plans each customized for different subset of data, and yet (3) is as adaptive as "plan-less" systems like Eddies? While the recently proposed Query Mesh (QM) approach provides a foundation for such an execution paradigm, it does not address the question of adaptivity required for highly dynamic environments. In this work, we fill this gap by proposing a Self-Tuning Query Mesh (ST-QM) --- an adaptive solution for content-based multi-plan execution engines. ST-QM addresses adaptive query processing by abstracting it as a concept drift problem --- a well-known subject in machine learning. Such abstraction allows to discard adaptivity candidates (i.e., the cases indicating a change in the environment) early in the process if they are insignificant or not "worthwhile" to adapt to, and thus minimize the adaptivity overhead. A unique feature of our aproach is that all logical transformations to the execution strategy get translated into a single inexpensive physical operation --- the classifier change. Our experimental evaluation using a continuous query engine shows the performance benefits of ST-QM approach over the alternatives, namely the non-adaptive and the Eddies-based solutions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Ms sql server. http://www.microsoft.com/sql/default.mspx.
|
 |
2
|
Arvind Arasu , Brian Babcock , Shivnath Babu , Jon McAlister , Jennifer Widom, Characterizing memory requirements for queries over continuous data streams, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 03-05, 2002, Madison, Wisconsin
[doi> 10.1145/543613.543642]
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
 |
6
|
Brian Babcock , Shivnath Babu , Mayur Datar , Rajeev Motwani , Jennifer Widom, Models and issues in data stream systems, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 03-05, 2002, Madison, Wisconsin
[doi> 10.1145/543613.543615]
|
| |
7
|
I. M. Chakravarti and et.al. Handbook of Methods of Applied Statistics, volume I. John Wiley & Sons, 1967.
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
DB2. http://www.ibm.com/software/data/db2/.
|
 |
12
|
|
| |
13
|
Elke A. Rundensteiner , Luping Ding , Timothy Sutherland , Yali Zhu , Brad Pielech , Nishant Mehta, CAPE: continuous query engine with heterogeneous-grained adaptivity, Proceedings of the Thirtieth international conference on Very large data bases, p.1353-1356, August 31-September 03, 2004, Toronto, Canada
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Jeffrey C. Schlimmer et.al. Beyond incremental processing: Tracking concept drift. In AAAI, pages 502--507, 1986.
|
| |
21
|
K. Claypool et.al. Teddies: Trained eddies for reactive stream processing. In DASFAA, pages 220--234, 2008.
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
 |
25
|
|
| |
26
|
|
| |
27
|
|
 |
28
|
|
| |
29
|
Oracle. http://www.oracle.com/index.html.
|
| |
30
|
|
| |
31
|
Q. Li et.al. Adaptively reordering joins during query execution. In ICDE, pages 26--35, 2007.
|
| |
32
|
|
 |
33
|
|
| |
34
|
|
| |
35
|
R. Nehme et.al. Query mesh: An efficient multi-route approach to query optimization. Technical Report CSD TR #08--009, Purdue University, April 2008.
|
 |
36
|
Shivnath Babu , Rajeev Motwani , Kamesh Munagala , Itaru Nishizawa , Jennifer Widom, Adaptive ordering of pipelined stream filters, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007615]
|
| |
37
|
|
 |
38
|
|
 |
39
|
|
 |
40
|
|
| |
41
|
|
| |
42
|
A. Tsymbal. The problem of concept drift: definitions and related work. Technical Report TCD-CS-2004-15, The University of Dublin, Trinity College, 2004.
|
| |
43
|
V. Raman et.al. Using state modules for adaptive query processing. In ICDE, pages 353--365, 2003.
|
 |
44
|
|
| |
45
|
H. Wang and J. Pei. A random method for quantifying changing distributions in data streams. In PKDD, pages 684--691, 2005.
|
| |
46
|
|
|