|
ABSTRACT
Query processing and optimization in mediator systems that access distributed non-proprietary sources pose many novel problems. Cost-based query optimization is hard because the mediator does not have access to source statistics information and furthermore it may not be easy to model the source's performance. At the same time, querying remote sources may be very expensive because of high connection overhead, long computation time, financial charges, and temporary unavailability. We propose a cost-based optimization technique that caches statistics of actual calls to the sources and consequently estimates the cost of the possible execution plans based on the statistics cache. We investigate issues pertaining to the design of the statistics cache and experimentally analyze various tradeoffs. We also present a query result caching mechanism that allows us to effectively use results of prior queries when the source is not readily available. We employ the novel invariants mechanism, which shows how semantic information about data sources may be used to discover cached query results of interest.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
[4] S. Adah and V.S. Subrahmanian. (1994) Amalgamating Knowledge Bases, III: Algorithms, data structures and query processing. Technical Report CS-TR-3124, Computer Science Department, University of Maryland, Aug. 1993. Accepted for publication in Journal of Logic Programming. (http://www.cs.umd.edu/projects/hermes/publications/abstracts/akbiii.ps)
|
| |
5
|
|
| |
6
|
[6] S. Adah and V.S. Subrahmanian. (1995) Intelligent Caching in Heterogeneous Reasoning and Mediator Systems, Proc. of the Second International Conference on Building and Sharing of Very Large-Scale Knowledge Bases (cd. N. Mars), pps 247-256, IOS Press, Twente, The Netherlands, May 1995.
|
 |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
[10] U. Dayal and H. Hwang. (1984) View definition and generalization for database integration in a multidatabase system., IEEE Trans. Software Eng., SE- 10(6):628-644, 1984.
|
| |
11
|
|
| |
12
|
[12] Ashish Gupta, Dinesh Katiyar, and Inderpal Singh Mumick. (1992) Counting Solutions to the View Maintenance Problem., In Workshop on Deductive Databases, JICSLP, 1992.
|
 |
13
|
Ashish Gupta , Inderpal Singh Mumick , V. S. Subrahmanian, Maintaining views incrementally, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.157-166, May 25-28, 1993, Washington, D.C., United States
|
 |
14
|
|
| |
15
|
|
| |
16
|
[16] Laks V.S. Lakshmanan, F. Sadri amd I.N. Subramanian. (1993) On the logical foundations of schema integration and evolution in Heterogeneous Database Systems. , Proc. DOOD-93, Phoenix, Arizona.
|
| |
17
|
[17] Laks V.S. Lakshmanan, F. Sadri and I.N. Subramanian. (1995) Logic and Algebraic Languages for Interoperability in Multidatabase Systems, submitted to Journal of Logic Programming.
|
| |
18
|
|
 |
19
|
James J. Lu , Guido Moerkotte , Joachim Schue , V. S. Subrahmanian, Efficient maintenance of materialized mediated views, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.340-351, May 22-25, 1995, San Jose, California, United States
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
[23] M. Scholl, C. Laasch, and M. Tresch. (1991) Updatable views in object-oriented databases., In Proc. Int. Conf. on Deductive and Object-Oriented Databases (DOOD), 1991.
|
 |
24
|
|
 |
25
|
|
 |
26
|
Michael Stonebraker , Anant Jhingran , Jeffrey Goh , Spyros Potamianos, On rules, procedure, caching and views in data base systems, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.281-290, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
27
|
|
| |
28
|
[28] V.S. Subrahmanian, S. Adah, A. Brink, R. Emery, J. Lu, A. Rajput, T.J. Rogers, R. Ross. (1994) HERMES: A Heterogeneous Reasoning and Mediator System, submitted for publication. (http://www.cs.umd.edu/projects/hermes/ overview/paper)
|
| |
29
|
|
| |
30
|
[30] G. Wiederhold, S. Jajodia, and W. Litwin. (1993) Integrating temporal data in a heterogeneous environment. , In Temporal Databases Benjamin/Cummings, Jan. 1993.
|
 |
31
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
| |
32
|
|
| |
33
|
|
| |
34
|
|
| |
35
|
|
 |
36
|
|
| |
37
|
|
 |
38
|
|
| |
39
|
|
| |
40
|
[40] S. Adah and X. Qian. (1995) Query Transformation in Heterogeneous Reasoning Systems., Submitted for publication.
|
| |
41
|
[41] S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom. (1994) The TSIMMIS Project: Integration of Heterogeneous Information Sources., In Proceedings of IPSJ Conference, Tokyo, Japan, October 1994. (Also available via anonymous FTP from host db.stanford.edu, file/pub/chawathe/1994/tsimmis-overview.ps.)
|
| |
42
|
|
| |
43
|
|
| |
44
|
[44] H. Lu and B.-C. Ooi and C.-H. Goh. (1993) Multidatabase Query Optimization: Issues and Solutions., Proc. RIDE-IMS '93, pp. 137-143, 1993.
|
CITED BY 85
|
|
Khaled A. Arisha , Fatma Ozcan , Robert Ross , V. s. Subrahmanian , Thomas Eiter , Sarit Kraus, Impact: A Platform for Collaborating Agents, IEEE Intelligent Systems, v.14 n.2, p.64-72, March 1999
|
|
|
Zaiqing Nie , Subbarao Kambhampati , Ullas Nambiar , Sreelakshmi Vaddi, Mining source coverage statistics for data integration, Proceedings of the 3rd international workshop on Web information and data management, November 09-01, 2001, Atlanta, Georgia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Diego Calvanese , Giuseppe De Giacomo , Maurizio Lenzerini , Moshe Y. Vardi, Rewriting of regular expressions and regular path queries, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.194-204, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
|
|
Vladimir Zadorozhny , Louiqa Raschid , Maria Esther Vidal , Tolga Urhan , Laura Bright, Efficient evaluation of queries in a mediator for WebSources, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
|
|
|
Todd Millstein , Alon Levy , Marc Friedman, Query containment for data integration systems, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.67-75, May 15-18, 2000, Dallas, Texas, United States
|
|
|
Diego Calvanese , Moshe Y. Vardi , Giuseppe de Giacomo , Maurizio Lenzerini, View-based query processing for regular path queries with inverse, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.58-66, May 15-18, 2000, Dallas, Texas, United States
|
|
|
|
|
|
Louis Degenaro , Arun Iyengar , Ilya Lipkind , Isabelle Rouvellou, A middleware system which intelligently caches query results, IFIP/ACM International Conference on Distributed systems platforms, p.24-44, April 03-07, 2000, New York, New York, United States
|
|
|
|
|
|
|
|
|
Naveen Ashish , Craig A. Knoblock , Cyrus Shahabi, Selectively materializing data in mediators by analyzing source structure, query distribution and maintenance cost, Proceedings of the 2nd international workshop on Web information and data management, p.33-37, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
|
|
|
Dennis Pedersen , Karsten Riis , Torben Bach Pedersen, Query optimization for OLAP-XML federations, Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP, p.57-64, November 08-08, 2002, McLean, Virginia, USA
|
|
|
Qiong Luo , Sailesh Krishnamurthy , C. Mohan , Hamid Pirahesh , Honguk Woo , Bruce G. Lindsay , Jeffrey F. Naughton, Middle-tier database caching for e-business, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
|
|
|
|
|
|
|
|
|
Diego Calvanese , Giuseppe De Giacomo , Maurizio Lenzerini , Moshe Y. Vardi, View-based query containment, Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.56-67, June 09-11, 2003, San Diego, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
L. M. Haas , P. M. Schwarz , P. Kodali , E. Kotlar , J. E. Rice , W. C. Swope, DiscoveryLink: a system for integrated access to life sciences data sources, IBM Systems Journal, v.40 n.2, p.489-511, February 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wen-Syan Li , Daniel C. Zilio , Vishal S. Batra , Calisto Zuzarte , Inderpal Narang, Load balancing and data placement for multi-tiered database systems, Data & Knowledge Engineering, v.62 n.3, p.523-546, September, 2007
|
|
|
|
|
|
Brenton Louie , Peter Mork , Fernando Martin-Sanchez , Alon Halevy , Peter Tarczy-Hornoch, Methodological Review: Data integration and genomic medicine, Journal of Biomedical Informatics, v.40 n.1, p.5-16, February, 2007
|
|
|
Mehmet Altinel , Christof Bornhövd , Sailesh Krishnamurthy , C. Mohan , Hamid Pirahesh , Berthold Reinwald, Cache tables: paving the way for an adaptive database cache, Proceedings of the 29th international conference on Very large data bases, p.718-729, September 09-12, 2003, Berlin, Germany
|
|
|
Alberto Pan , Juan Raposo , Manuel Álvarez , Paula Montoto , Vicente Orjales , Justo Hidalgo , Lucía Ardao , Anastasio Molano , Ángel Viña, The denodo data integration platform, Proceedings of the 28th international conference on Very Large Data Bases, p.986-989, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
K. Selçuk Candan , Divyakant Agrawal , Wen-Syan Li , Oliver Po , Wang-Pin Hsiung, View invalidation for dynamic content caching in multitiered architectures, Proceedings of the 28th international conference on Very Large Data Bases, p.562-573, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mahmoud El Samad , Julien Gossa , Franck Morvan , Abdelkader Hameurlain , Jean-Marc Pierson , Lionel Brunie, A monitoring service for large-scale dynamic query optimisation in a grid environment, International Journal of Web and Grid Services, v.4 n.2, p.222-246, June 2008
|
|
|
|
|
|
|
|
|
|
|