ACM Home Page
Please provide us with feedback. Feedback
Cost-based query scrambling for initial delays
Full text PdfPdf (1.81 MB)
Source International Conference on Management of Data archive
Proceedings of the 1998 ACM SIGMOD international conference on Management of data table of contents
Seattle, Washington, United States
Pages: 130 - 141  
Year of Publication: 1998
ISBN:0-89791-995-5
Also published in ...
Authors
Tolga Urhan  University of Maryland
Michael J. Franklin  University of Maryland
Laurent Amsaleg  IRISA
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 43,   Citation Count: 52
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/276304.276317
What is a DOI?

ABSTRACT

Remote data access from disparate sources across a wide-area network such as the Internet is problematic due to the unpredictable nature of the communications medium and the lack of knowledge about the load and potential delays at remote sites. Traditional, static, query processing approaches break down in this environment because they are unable to adapt in response to unexpected delays. Query scrambling has been proposed to address this problem. Scrambling modifies query execution plans on-the-fly when delays are encountered during runtime. In its original formulation, scrambling was based on simple heuristics, which although providing good performance in many cases, were also shown to be susceptible to problems resulting from bad scrambling decisions. In this paper we address these shortcomings by investigating ways to exploit query optimization technology to aid in making intelligent scrambling choices. We propose three different approaches to using query optimization for scrambling. These approaches vary, for example, in whether they optimize for total work or response-time, and whether they construct partial or complete alternative plans. Using a two-phase randomized query optimizer, a distributed query processing simulator, and a workload derived from queries of the TPCD benchmark, we evaluate these different approaches and compare their ability to cope with initial delays in accessing remote sources. The results show that cost-based scrambling can effectively hide initial delays, but that in the absence of good predictions of expected delay durations, there are fundamental tradeoffs between risk aversion and effectiveness.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
ABF+97
L. Amsaleg, P. Bonnet, M. Franklin, A. Tomasic, and T. Urhan Improving Responsiveness for Wide-Area Data Access. IEEE Data Engineering Bulletin, Vol. 20, No. 3.
 
AFT98
 
AFTU96
 
Ant93
 
Bro92
K. Brown. Prpl: A database workload specification language. Master's thesis, University of Winsconsin, Madison, Winsconsin, 1992.
 
CBTY89
A. Chen, D. Brill, M. Templeton, and C. Yu. Distributed Query Processing in a Multiple Database System. IEEE Journal on Selected Areas in Communications, 7(3), 1989.
 
CD96
CG94
DSD95
GHK92
Gra93
IK90
 
INSS92
IW87
 
Kim95
 
MMM97
ML86
 
ONK+97
E Ozcan, S. Nural, P. Koksal, C. Evrendilek, A. Dogac. Dynamic Query optimization in Multidatabases. Data Engineering Bulletin, Vol. 20, No. 3, September, 1997.
RAH+96
SAC+79
 
SAD+95
Sha86
 
SLR97
 
THMB95
 
Tra97
Transaction Processing Council. TPC Benchmark D Standard Specification, Rev. 1.2.3.
 
TRV96

CITED BY  52

Collaborative Colleagues:
Tolga Urhan: colleagues
Michael J. Franklin: colleagues
Laurent Amsaleg: colleagues