|
ABSTRACT
This paper describes the techniques used to optimize relational queries in the SDD-1 distributed database system. Queries are submitted to SDD-1 in a high-level procedural language called Datalanguage. Optimization begins by translating each Datalanguage query into a relational calculus form called an envelope, which is essentially an aggregate-free QUEL query. This paper is primarily concerned with the optimization of envelopes.
Envelopes are processed in two phases. The first phase executes relational operations at various sites of the distributed database in order to delimit a subset of the database that contains all data relevant to the envelope. This subset is called a reduction of the database. The second phase transmits the reduction to one designated site, and the query is executed locally at that site.
The critical optimization problem is to perform the reduction phase efficiently. Success depends on designing a good repertoire of operators to use during this phase, and an effective algorithm for deciding which of these operators to use in processing a given envelope against a given database. The principal reduction operator that we employ is called a semijoin. In this paper we define the semijoin operator, explain why semijoin is an effective reduction operator, and present an algorithm that constructs a cost-effective program of semijoins, given an envelope and a database.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
BERNSTEIN, P.A., AND GOODMAN, N. The power of natural semijoins. SIAM J. Comput. 10, 4 (Nov. 1981).
|
| |
4
|
BERNSTEIN, P.A., AN}) GOODMAN, N. The power of inequality semi-joins. To appear in Inf. Syst.
|
 |
5
|
|
 |
6
|
|
| |
7
|
CHIU, D.M. Optimal query interpretation for distributed databases. Ph.D. Dissertation, Harvard Univ., Cambridge, Mass., Dec. 1979.
|
 |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
GOODMAN, N., BERNSTEIN, P.A., WONG, W., REEVE, C. L., AND ROTHNIE, JR., J.B. Query processing in SDD-1. Tech. Rep. CCA-79-06, Computer Corp. of America, Cambridge, Mass., Oct. 1979.
|
 |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
HELD, G.D., STONEBRAKER, M., AND WONG, W. INGRES--A relational database management system. In Proc. AFIPS 1975 NCC, vol. 44, AFIPS Press, Arlington, Va., pp. 409-416.
|
| |
16
|
|
| |
17
|
HEVNER, A.R., AND YAO, S.B. Query processing in distributed database systems. IEEE Trans. Softw. Eng. SE.5, 3 (May 1979), 177-187.
|
| |
18
|
KARP, R.M., AND MILLER, R.E. Properties of a model for parallel computation: Determinary, termination, queueing. SIAM J. Appl. Math. 14 (Nov. 1966), 1390-1411.
|
| |
19
|
KERSCHBERG, L., TING, P.D., AND YAO, S.B. Query optimization in star computer networks. Unpublished Rep., Bell Laboratories, Holmdel, N.J., 1980.
|
| |
20
|
MARILL, T., AND STERN, D.H. The datacomputer: A network data utility. In Proc. AFIPS 1975 NCC, vol. 44, AFIPS Press, Arlington, Va.
|
 |
21
|
|
| |
22
|
ROTHNIE, J.B. Private communication. See also: A distributed database management system for command and control applications: Semiannual technical report 2. Tech. Rep. CCA-80-03, Computer Corp. of America, Cambridge, Mass., Jan. 1980.
|
 |
23
|
J. B. Rothnie, Jr. , P. A. Bernstein , S. Fox , N. Goodman , M. Hammer , T. A. Landers , C. Reeve , D. W. Shipman , E. Wong, Introduction to a system for distributed databases (SDD-1), ACM Transactions on Database Systems (TODS), v.5 n.1, p.1-17, March 1980
[doi> 10.1145/320128.320129]
|
| |
24
|
ROTHNIE, J.B., AND GOODMAN, N. An overview of the preliminary design of SDD-1. In Proc. Berkeley Workshop Distributed Data Management and Computer Newtorks, May 1977, pp. 39-57.
|
| |
25
|
Ro~ItNIz, J.B., A~D GOOD,N, N. A survey of research and develoment in distributed database systems. In Proc. 3rd Int. Conf. Very Large Databases, Oct. 1977, pp. 48-61.
|
| |
26
|
ROTHNIE, J.B., GOODMAN, N., AND MARILL, T. Database architecture in a network environment. In Protocols and Techniques for Data Communication Networks, F.F. Kuo, Ed. Prentice-Hall, Englewood Cliffs, N.J., 1980.
|
| |
27
|
SELINGER, P.G., AND ADIBA, M. Access path selection in distributed database management systems. In Proc. Int. Conf. Databases, Univ. Aberdeen, Aberdeen, Scotland, July 1980.
|
 |
28
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
 |
29
|
|
| |
30
|
WoNt, E. Retrieving dispersed data from SDD-1. In Proc. Berkeley Workshop Distributed Data Management and Computer Networks, May 1977, pp. 217-235.
|
 |
31
|
|
 |
32
|
|
| |
33
|
Yu, C.T., AND OZSOYOOLU, M.Z. An algorithm for tree-query membership of a distributed query. In Proc. Compsac79, IEEE Computer Society, Nov. 1979, pp. 306-312.
|
| |
34
|
Yu, C.T., AND OZSOYOGLU, M.Z. On determining tree query membership of a distributed query. Tech. Rep. TR80-1, Dep. Computing Science, Univ. Alberta, Edmonton, Alta., Canada, Jan. 1980.
|
CITED BY 129
|
|
|
|
|
|
|
|
Praveen Seshadri , Joseph M. Hellerstein , Hamid Pirahesh , T. Y. Cliff Leung , Raghu Ramakrishnan , Divesh Srivastava , Peter J. Stuckey , S. Sudarshan, Cost-based optimization for magic: algebra and implementation, ACM SIGMOD Record, v.25 n.2, p.435-446, June 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Diane Jantz , E. A. Unger , R. McBride , Jacob Slonim, Query processing in a distributed data base, Proceedings of the 1983 ACM SIGSMALL symposium on Personal and small computers, p.237-244, December 07-09, 1983, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sunil Choenni , Henk M. Blanken , Thiel Chang, On the automation of physical database design, Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice, p.358-367, February 14-16, 1993, Indianapolis, Indiana, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alon Y. Halevy , Naveen Ashish , Dina Bitton , Michael Carey , Denise Draper , Jeff Pollock , Arnon Rosenthal , Vishal Sikka, Enterprise information integration: successes, challenges and controversies, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael Stonebraker , Paul M. Aoki , Witold Litwin , Avi Pfeffer , Adam Sah , Jeff Sidell , Carl Staelin , Andrew Yu, Mariposa: a wide-area distributed database system, The VLDB Journal — The International Journal on Very Large Data Bases, v.5 n.1, p.048-063, January 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|