|
ABSTRACT
The join operation is one of the fundamental relational database query operations. It facilitates the retrieval of information from two different relations based on a Cartesian product of the two relations. The join is one of the most diffidult operations to implement efficiently, as no predefined links between relations are required to exist (as they are with network and hierarchical systems). The join is the only relational algebra operation that allows the combining of related tuples from relations on different attribute schemes. Since it is executed frequently and is expensive, much research effort has been applied to the optimization of join processing. In this paper, the different kinds of joins and the various implementation techniques are surveyed. These different methods are classified based on how they partition tuples from different relations. Some require that all tuples from one be compared to all tuples from another; other algorithms only compare some tuples from each. In addition, some techniques perform an explicit partitioning, whereas others are implicit.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
ATKINSON, M., BANCILHON, F., DEWITT, D., DITTRICH, K., MAIER, D., AND ZDOlXIIK, S. 1989 The object-oriented database system manifesto. In Proceedings of the Deductive an d Object Oriented Database8 Conference.
|
| |
4
|
BABA, T., SAITO, H., AND YAO, S. B. 1987. A network algorithm for relational database operations. In International Workshop on Database Machines, pp. 257-270.
|
 |
5
|
|
| |
6
|
BANCILHON, F., RICHARD, P., AND SCHOLL, M 1983. VERSO: The relational database machine. In Advanced Database Machine Architecture, D. Hsiao, Ed., Prentice-Hall, Englewood Cliffs, N.J.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
BARU, C. K, FRIEDER, 0., DANDLUR, D., AND SEGAL, M. 1987. Join on a cube: Analysis, simulation and implementation. In Proceedings of International Workshop on Database Machines (Dec.), pp. 74-87.
|
| |
11
|
BEEm, C., AND VARDI, M. Y. 1981. On the properties of join dependencies. In Advances zn Database Theory, vol. 1. Plenum Publishing, New York, pp. 25-72.
|
| |
12
|
|
 |
13
|
|
| |
14
|
BENTLEY, J. L. 1979. Multidimensional binary search trees in database applications. IEEE Trans. Softw. Eng. SE-5, 4 (July).
|
| |
15
|
BENTLEY, J. L., AND KUNG, H. T. 1979. A tree machine for searching problems. IEEE Conference on Parallel Processing, pp. 257-266.
|
 |
16
|
|
| |
17
|
BERNSTmN, P. A., A~D GOODMAN, N. 1979a The theory of semijoins. Computer Corporation of America Rep. 79-27, Cambridge, Mass.
|
| |
18
|
BERNSTmN, P. A., AND GOODMAN, N. 1979b. Inequality semijoins. Computer Corporation of America Rep. 79-28, Cambridge, Mass.
|
| |
19
|
BERNSTEIN, P A, AND GOODMAN, N. 1980. The power of inequality semijolns Aiken Computa~ tion Lab Rep. 12-80, Harvard Umv., Cambridge, Mass.
|
| |
20
|
|
| |
21
|
|
 |
22
|
|
| |
23
|
BLASGEN, M. W., AND ESWARAN, K. P. 1977. Storage and access in relational databases. IBM Syst. J. 16, 4, 363-377
|
 |
24
|
|
| |
25
|
|
| |
26
|
BmTTON LEE, INC. 1981. IDM 500: Intelligent Database Machine Product Description.
|
 |
27
|
|
| |
28
|
Michael Carey , George Lapis , Eugene Shekita , Bruce Lindsay , John McPherson, An incremental join attachment for Starburst, Proceedings of the sixteenth international conference on Very large databases, p.662-673, September 1990, Brisbane, Australia
|
| |
29
|
|
 |
30
|
|
| |
31
|
|
| |
32
|
|
| |
33
|
C~mSTODUL^KIS, S. 1985 Estimating block transfer and join sizes In Proceedings of SIGMOD CODASYL, 1971. Data Base Task Group Report.
|
 |
34
|
|
| |
35
|
ConD, E. F. 1972. Relational completeness of data base sublanguages. In Data Base Systems. Prentice-Hall, Englewood Cliffs, N.J., pp. 65-98.
|
| |
36
|
|
| |
37
|
|
| |
38
|
DATE, C. J. 1983. The outer join. In Proceedings of International Conference on Databases (Cambridge, England), pp. 76-106.
|
| |
39
|
DAYAL, U. 1985. Query processing in a multidatabase system. In Query Processzng in Database Systems, W. Kim, D. S. Reiner, and D. S. Batory, Eds. Springer-Verlag, New York, pp. 81-108.
|
| |
40
|
|
| |
41
|
|
| |
42
|
DESHPANDE, V., LARSON, P.-A., AND MARTIN, T. P. 1990. Parallel join algorithms for nested relations on shared-memory multiprocessors. IEEE Symposium on Parallel and Distributed Processing, pp. 344-351.
|
| |
43
|
DEWITT, D., AND GERBER, R. J. 1985. Multiprocessor hash-based join algorithms. In Proceedings of Conference on Very Large Data Bases, pp. 151-164.
|
 |
44
|
|
 |
45
|
David J DeWitt , Randy H Katz , Frank Olken , Leonard D Shapiro , Michael R Stonebraker , David Wood, Implementation techniques for main memory database systems, Proceedings of the 1984 ACM SIGMOD international conference on Management of data, June 18-21, 1984, Boston, Massachusetts
|
| |
46
|
|
| |
47
|
EHRENSBERGER, M. J. 1984. The DBC/1012 database computer's systems-architecture, components, and performance. In Minnowbrook Workshop on Database Machines.
|
| |
48
|
|
| |
49
|
|
| |
50
|
EPSTEIN, R., AND STONEBRAKER, M. 1980. Analysis of distributed database processing strategies. In Proceedings of Conference on Very Large Data Bases, pp. 92-101.
|
| |
51
|
EPSTEIN, R. 1982. Query Processing Techniques for Distributed, Relational Database Systems. University Microfilms International, Ann Arbor, Mich.
|
 |
52
|
|
| |
53
|
|
 |
54
|
Shinya Fushimi , Masaru Kitsuregawa , Masaya Nakayama , Hidehiko Tanaka , Tohru Moto-oka, Algorithm and performance evaluation of adaptive multidimensional clustering technique, Proceedings of the 1985 ACM SIGMOD international conference on Management of data, p.308-318, May 1985, Austin, Texas, United States
|
| |
55
|
|
| |
56
|
GERBER, R. J. 1986. Datafiow query processing using multiprocessor hash-partitioned algorithms. Computer Sciences Tech. Rep. No. 672, Computer Sciences Dept., Univ. of Wisconsin, Madison, Wisc.
|
| |
57
|
|
| |
58
|
GOODMAN, J. R. 1981. An investigation of multiprocessor structures and algorithms for database management. Tech. Rep. UCB/ERL, M81/33, Univ. oF California, Berkeley.
|
 |
59
|
|
| |
60
|
|
| |
61
|
|
 |
62
|
|
| |
63
|
|
| |
64
|
|
| |
65
|
|
 |
66
|
|
| |
67
|
HS~AO, D. K. (ED.) 1980. Collected Readings on a Database Computer (DBC). The Ohio State University, Columbus, Ohio
|
| |
68
|
|
| |
69
|
HURSC~, J. L. 1989. Relational joins: More than meets the eye. Database Program. Design 2, 12 (Dec.), 64-70.
|
| |
70
|
HURSON, A. R. 1981. An associative backend for data base management. IEEE Workshop on Computer Architecture for Pattern Analysis and Image Data Base Management, pp. 225-230.
|
| |
71
|
HURSON, A. R. 1986. VLSI time/space complexity of an associative parallel join module. In International Conference on Parallel Processing, pp. 379-386.
|
| |
72
|
|
| |
73
|
IBM 1978. IMS/VS General Information Manual GH20-1260. White Plains, New York
|
| |
74
|
INTEL CORPORATION. iDBP DBMS Reference Manual, Order No. 222100.
|
 |
75
|
|
| |
76
|
KAMBAYASm, Y 1985 Processing cyclic queries. In Query Processing in Database Systems, W. Kim, D. S. Reiner, and D. S. Batory, Eds. Sprmger-Verlag, New York, pp. 62-78.
|
| |
77
|
|
| |
78
|
KANG, H., AND ROUSSOPULOS, N. 1987b. On the cost-effectiveness of a semijoin in query processing. In COMPSAC, pp. 531-537.
|
 |
79
|
|
 |
80
|
|
| |
81
|
|
| |
82
|
|
| |
83
|
KITSUREGAWA, M., TANAKA, H., AND MOTO-OKA, T. 1983. Application of hash to database machine and its architecture. New Generation Com~ut. 1, }.
|
| |
84
|
|
| |
85
|
|
| |
86
|
|
| |
87
|
|
 |
88
|
|
 |
89
|
|
| |
90
|
|
| |
91
|
|
| |
92
|
|
 |
93
|
|
 |
94
|
Richard J. Lipton , Jeffrey F. Naughton , Donovan A. Schneider, Practical selectivity estimation through adaptive sampling, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.1-11, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
95
|
Lu, H , AND CAREY, M. 1985. Some experimental results on distributed join algorithms in a local network. In Proceedings of Conference on Very Large Databases, pp 292-304.
|
| |
96
|
|
| |
97
|
|
| |
98
|
|
| |
99
|
MENEZES, B. L., THADANI, K., DALE, A. G., AND JENEVmN, R 1987. Design of a HyperKYK- LOS-based multiprocessor architecture for high-performance join operations. In International Workshop on Database Machines, pp. 74-87.
|
| |
100
|
|
| |
101
|
|
| |
102
|
|
| |
103
|
|
| |
104
|
|
| |
105
|
|
| |
106
|
|
| |
107
|
0MIECINSKI, E., AND SHONKWILER, R. 1990. Parallel join processing using nonclustered indexes for a shared memory environment. In IEEE Symposium on Parallel and Distributed Processing, pp. 144-159.
|
 |
108
|
|
| |
109
|
|
| |
110
|
|
 |
111
|
|
| |
112
|
|
 |
113
|
|
| |
114
|
|
| |
115
|
PRAMANIK, S., AND FOTOUHI, F. 1985. An index database machine: An efficient m-way join processor. In Proceedings of Hawaii International Conference on System Sciences, vol. 1, pp. 330-338.
|
 |
116
|
|
| |
117
|
|
| |
118
|
|
| |
119
|
|
 |
120
|
James P. Richardson , Hongjun Lu , Krishna Mikkilineni, Design and evaluation of parallel pipelined join algorithms, Proceedings of the 1987 ACM SIGMOD international conference on Management of data, p.399-409, May 27-29, 1987, San Francisco, California, United States
|
 |
121
|
|
 |
122
|
Arnon Rosenthal , Cesar Galindo-Legaria, Query graphs, implementing trees, and freely-reorderable outerjoins, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.291-299, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
123
|
|
| |
124
|
RUDOLPH, J. A. 1972a. A production implementation of an associative array processor. In Proceedings of Fall ,)o~nt Computer Conference, pp. 229-241.
|
| |
125
|
|
 |
126
|
|
| |
127
|
SAKAI, H., IWATA, K., KAMIYA, S., ABE, M., TANAKA, A., SHIBAYAMA, S., AND MURAKAMI, K. 1984. Design and implementation of the relational database engine. In Proceedings of Conference on Fifth Generation Computer Systems, pp. 419-435.
|
 |
128
|
|
 |
129
|
|
 |
130
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
 |
131
|
|
 |
132
|
|
| |
133
|
|
| |
134
|
|
 |
135
|
|
| |
136
|
|
| |
137
|
TONG, F., AND YAO, S. B., 1982. Performance analysis of database join processors In National Computer Conference, pp. 627-637.
|
| |
138
|
|
| |
139
|
VALDURmZ, P. 1982. Semijoin algorithms for distributed databases. In 3rd International Symposium o7~ Distributed Databases.
|
| |
140
|
VALDUmEZ, P 1986 Optimization of complex database queries using join indices. Database Eng. 9, 4 (Dec.), 10-16.
|
 |
141
|
|
| |
142
|
VALDURmZ, P., AND BORAL, H. 1986. Evaluation of recurmve queries using join indices. In Proceedings of Conference on Expert Database Systems.
|
| |
143
|
V^LDURIEZ, P, AND GARDAmN, G. 1982. Multiprocessor join algorithms of relations In Improving Usability and Responsiveness. Academic Press, New York, pp. 219-237
|
 |
144
|
|
 |
145
|
|
| |
146
|
|
| |
147
|
|
| |
148
|
|
| |
149
|
WHANG, K.-Y., WIEDERHOLD, G., AND SAGALOWICZ, D. 1985. The property of separability and its application to physical database design. In Query Processing *n Database Systems, W Kim, D. S Reiner, and D S. Batory, Eds. Springer- Verlag, New York, pp. 297-317.
|
| |
150
|
YAO, S B., TONa, F., ANn SHENG Y. Z. 1981 The system architecture of a data base machine (DBM). IEEE Database Eng. Bull. 4 2, 53-62.
|
| |
151
|
|
| |
152
|
|
| |
153
|
|
| |
154
|
C. T. Yu , K.-C. Guh , W. Zhang , M. Templeton , D. Brill , A.L.P. Chen, Algorithms to process distributed queries in fast local networks, IEEE Transactions on Computers, v.36 n.10, p.1153-1164, Oct. 1987
[doi> 10.1109/TC.1987.1676856]
|
| |
155
|
|
 |
156
|
|
 |
157
|
|
| |
158
|
|
| |
159
|
BROWNSMITH, J D., AND SU, S. Y. W. 1980. Performance analysis of the equijoin operation m the MICRONET computer system In Proceedrags oflCC, pp. 264 268.
|
| |
160
|
CHASE, K. 1981. Join graphs and acyclic database schemes. In Proceedings of Conference on Very Large Data Bases, pp 95-100
|
 |
161
|
|
| |
162
|
|
 |
163
|
|
| |
164
|
DEW,T% D. J. 1979. DIRECT: A multiprocessor organization for supporting relational database management systems. IEEE Trans. Comput. C-28, 6, 395-406.
|
| |
165
|
DEWITT, D. J., NAUGHTON, J. F., AND SCHNEIDER, D. A. 1991. An evaluation of non-equijoin algorithms. Tech. Rep. 1011, Univ. of Wisconsin-Madison, Madison, Wlsc.
|
 |
166
|
|
 |
167
|
|
 |
168
|
|
| |
169
|
|
| |
170
|
HONEYMAN, P. 1980. Extension joins. In Proceedings of Conferences on Very Large Data Bases, pp. 239-244.
|
| |
171
|
|
| |
172
|
KAMIBAYASHI, N, AND SEO, K. 1982. SPIRIT-III: An advanced relational database machine introducing a novel data staging architecture with tuple stream filters to preprocess relational algebra. In National Computer Conference Proceedings, pp. 605-616.
|
 |
173
|
Arthur M. Keller, Algorithms for translating view updates to database updates for views involving selections, projections, and joins, Proceedings of the fourth ACM SIGACT-SIGMOD symposium on Principles of database systems, p.154-163, March 25-27, 1985, Portland, Oregon, United States
[doi> 10.1145/325405.325423]
|
| |
174
|
K~NT, W. 1979. The entity join. In Proceedings of Conference on Very large Data Bases, pp. 232 -238.
|
 |
175
|
|
| |
176
|
|
 |
177
|
|
| |
178
|
MENON, M. J. AND HSIAO, D. K. 1983. Design and analysis of join operations of database machines. In Advanced Database Machine Architecture, D. K. Hsiao, Ed. Prentice-Hall, Englewood Cliffs, N.J., pp. 203-255.
|
 |
179
|
|
 |
180
|
|
| |
181
|
MERRETT, T, H., KAMBAYASm, Y., AND YASUURA, H. 1981. Scheduling of page fetches in join operations. In Proceedings of Conference on Very Large Data Bases, pp. 488-497.
|
| |
182
|
|
| |
183
|
|
| |
184
|
QADAH, G. Z. 1984. Evaluation of performance of the equi-join operation on the Michigan relational database machine. In Proceedings of Conference on Parallel Processing, pp. 260-265.
|
| |
185
|
QADAH, G. Z. 1985. The equijoin operation on a multiprocessor database machine. In Proceed~ ings of Interna~ional Workshop on Database Machines, Spriager-Verlag, New York, pp. 35-67.
|
| |
186
|
|
| |
187
|
|
| |
188
|
RISSANEN, J. 1979. Theory of joins for relational databases: A tutorial survey. In Proceedings of Symposium on Mathematical Foundations of Computer Science, Lecture Notes in Computer Science, vol. 64 Springer-Verlag, New York, pp. 537-551.
|
 |
189
|
|
| |
190
|
|
| |
191
|
SCHUSTER, S. A., }~GUYEN, H. B., OZKARAHAN, E. A., AND SMITH, K. C. 1979. RAP.2: An associative processor for databases and its applications. {EEE Trans. Comput. C-28, 6, 446-458.
|
 |
192
|
|
| |
193
|
SHAW, D. E. ET AL. 1981. The NON-VON database machine: A brief overview. Database Eng. 4, 2
|
| |
194
|
|
| |
195
|
Su, S. Y. W., NGUYEN, L. H., EMAN, A., AND LIPOVSKI, G. J. 1979. The architectural features and implementation techniques of multicell CASSM. L~EE Trans. Comput. C-28, 6 (June), 430-445.
|
| |
196
|
|
| |
197
|
VARm, M. Y. 1980. Axiomatization of functional and join dependencies in the relational model. Weizman InstiLute M.Sc. thesis, Rehovot, Israel.
|
| |
198
|
VARDI, M. Y. 1983. Inferring multivalued dependencies from functional and join dependencies. Acta Inf. 19, 2, 305-324.
|
CITED BY 59
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrew Lim , Jennifer Lai-Pheng Kwan , Wee-Chong Oon, Page access scheduling in join processing, Proceedings of the eighth international conference on Information and knowledge management, p.276-283, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yong-Ju Lee , Ho-Hyun Park , Nam-Hee Hong , Chin-Wan Chung, Spatial query processing using object decomposition method, Proceedings of the fifth international conference on Information and knowledge management, p.53-61, November 12-16, 1996, Rockville, Maryland, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"Christian Mancas : Reviewer"
As the authors note, this overview of join processing in relational
databases simply collates information from the available literature. It
presents all known join types (general, equi, natural, semi, outer, and
self) as well as derived relat
more...
|