ACM Home Page
Please provide us with feedback. Feedback
Join processing in relational databases
Full text PdfPdf (4.42 MB)
Source ACM Computing Surveys (CSUR) archive
Volume 24 ,  Issue 1  (March 1992) table of contents
Pages: 63 - 113  
Year of Publication: 1992
ISSN:0360-0300
Authors
Priti Mishra
Margaret H. Eich  Southern Methodist Univ., Dallas, TX
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 44,   Downloads (12 Months): 416,   Citation Count: 59
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/128762.128764
What is a DOI?

ABSTRACT

The join operation is one of the fundamental relational database query operations. It facilitates the retrieval of information from two different relations based on a Cartesian product of the two relations. The join is one of the most diffidult operations to implement efficiently, as no predefined links between relations are required to exist (as they are with network and hierarchical systems). The join is the only relational algebra operation that allows the combining of related tuples from relations on different attribute schemes. Since it is executed frequently and is expensive, much research effort has been applied to the optimization of join processing. In this paper, the different kinds of joins and the various implementation techniques are surveyed. These different methods are classified based on how they partition tuples from different relations. Some require that all tuples from one be compared to all tuples from another; other algorithms only compare some tuples from each. In addition, some techniques perform an explicit partitioning, whereas others are implicit.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
ATKINSON, M., BANCILHON, F., DEWITT, D., DITTRICH, K., MAIER, D., AND ZDOlXIIK, S. 1989 The object-oriented database system manifesto. In Proceedings of the Deductive an d Object Oriented Database8 Conference.
 
4
BABA, T., SAITO, H., AND YAO, S. B. 1987. A network algorithm for relational database operations. In International Workshop on Database Machines, pp. 257-270.
5
 
6
BANCILHON, F., RICHARD, P., AND SCHOLL, M 1983. VERSO: The relational database machine. In Advanced Database Machine Architecture, D. Hsiao, Ed., Prentice-Hall, Englewood Cliffs, N.J.
 
7
 
8
 
9
 
10
BARU, C. K, FRIEDER, 0., DANDLUR, D., AND SEGAL, M. 1987. Join on a cube: Analysis, simulation and implementation. In Proceedings of International Workshop on Database Machines (Dec.), pp. 74-87.
 
11
BEEm, C., AND VARDI, M. Y. 1981. On the properties of join dependencies. In Advances zn Database Theory, vol. 1. Plenum Publishing, New York, pp. 25-72.
 
12
13
 
14
BENTLEY, J. L. 1979. Multidimensional binary search trees in database applications. IEEE Trans. Softw. Eng. SE-5, 4 (July).
 
15
BENTLEY, J. L., AND KUNG, H. T. 1979. A tree machine for searching problems. IEEE Conference on Parallel Processing, pp. 257-266.
16
 
17
BERNSTmN, P. A., A~D GOODMAN, N. 1979a The theory of semijoins. Computer Corporation of America Rep. 79-27, Cambridge, Mass.
 
18
BERNSTmN, P. A., AND GOODMAN, N. 1979b. Inequality semijoins. Computer Corporation of America Rep. 79-28, Cambridge, Mass.
 
19
BERNSTEIN, P A, AND GOODMAN, N. 1980. The power of inequality semijolns Aiken Computa~ tion Lab Rep. 12-80, Harvard Umv., Cambridge, Mass.
 
20
 
21
22
 
23
BLASGEN, M. W., AND ESWARAN, K. P. 1977. Storage and access in relational databases. IBM Syst. J. 16, 4, 363-377
24
 
25
 
26
BmTTON LEE, INC. 1981. IDM 500: Intelligent Database Machine Product Description.
27
 
28
 
29
30
 
31
 
32
 
33
C~mSTODUL^KIS, S. 1985 Estimating block transfer and join sizes In Proceedings of SIGMOD CODASYL, 1971. Data Base Task Group Report.
34
 
35
ConD, E. F. 1972. Relational completeness of data base sublanguages. In Data Base Systems. Prentice-Hall, Englewood Cliffs, N.J., pp. 65-98.
 
36
 
37
 
38
DATE, C. J. 1983. The outer join. In Proceedings of International Conference on Databases (Cambridge, England), pp. 76-106.
 
39
DAYAL, U. 1985. Query processing in a multidatabase system. In Query Processzng in Database Systems, W. Kim, D. S. Reiner, and D. S. Batory, Eds. Springer-Verlag, New York, pp. 81-108.
 
40
 
41
 
42
DESHPANDE, V., LARSON, P.-A., AND MARTIN, T. P. 1990. Parallel join algorithms for nested relations on shared-memory multiprocessors. IEEE Symposium on Parallel and Distributed Processing, pp. 344-351.
 
43
DEWITT, D., AND GERBER, R. J. 1985. Multiprocessor hash-based join algorithms. In Proceedings of Conference on Very Large Data Bases, pp. 151-164.
44
45
 
46
 
47
EHRENSBERGER, M. J. 1984. The DBC/1012 database computer's systems-architecture, components, and performance. In Minnowbrook Workshop on Database Machines.
 
48
 
49
 
50
EPSTEIN, R., AND STONEBRAKER, M. 1980. Analysis of distributed database processing strategies. In Proceedings of Conference on Very Large Data Bases, pp. 92-101.
 
51
EPSTEIN, R. 1982. Query Processing Techniques for Distributed, Relational Database Systems. University Microfilms International, Ann Arbor, Mich.
52
 
53
54
 
55
 
56
GERBER, R. J. 1986. Datafiow query processing using multiprocessor hash-partitioned algorithms. Computer Sciences Tech. Rep. No. 672, Computer Sciences Dept., Univ. of Wisconsin, Madison, Wisc.
 
57
 
58
GOODMAN, J. R. 1981. An investigation of multiprocessor structures and algorithms for database management. Tech. Rep. UCB/ERL, M81/33, Univ. oF California, Berkeley.
59
 
60
 
61
62
 
63
 
64
 
65
66
 
67
HS~AO, D. K. (ED.) 1980. Collected Readings on a Database Computer (DBC). The Ohio State University, Columbus, Ohio
 
68
 
69
HURSC~, J. L. 1989. Relational joins: More than meets the eye. Database Program. Design 2, 12 (Dec.), 64-70.
 
70
HURSON, A. R. 1981. An associative backend for data base management. IEEE Workshop on Computer Architecture for Pattern Analysis and Image Data Base Management, pp. 225-230.
 
71
HURSON, A. R. 1986. VLSI time/space complexity of an associative parallel join module. In International Conference on Parallel Processing, pp. 379-386.
 
72
 
73
IBM 1978. IMS/VS General Information Manual GH20-1260. White Plains, New York
 
74
INTEL CORPORATION. iDBP DBMS Reference Manual, Order No. 222100.
75
 
76
KAMBAYASm, Y 1985 Processing cyclic queries. In Query Processing in Database Systems, W. Kim, D. S. Reiner, and D. S. Batory, Eds. Sprmger-Verlag, New York, pp. 62-78.
 
77
 
78
KANG, H., AND ROUSSOPULOS, N. 1987b. On the cost-effectiveness of a semijoin in query processing. In COMPSAC, pp. 531-537.
79
80
 
81
 
82
 
83
KITSUREGAWA, M., TANAKA, H., AND MOTO-OKA, T. 1983. Application of hash to database machine and its architecture. New Generation Com~ut. 1, }.
 
84
 
85
 
86
 
87
88
89
 
90
 
91
 
92
93
94
 
95
Lu, H , AND CAREY, M. 1985. Some experimental results on distributed join algorithms in a local network. In Proceedings of Conference on Very Large Databases, pp 292-304.
 
96
 
97
 
98
 
99
MENEZES, B. L., THADANI, K., DALE, A. G., AND JENEVmN, R 1987. Design of a HyperKYK- LOS-based multiprocessor architecture for high-performance join operations. In International Workshop on Database Machines, pp. 74-87.
 
100
 
101
 
102
 
103
 
104
 
105
 
106
 
107
0MIECINSKI, E., AND SHONKWILER, R. 1990. Parallel join processing using nonclustered indexes for a shared memory environment. In IEEE Symposium on Parallel and Distributed Processing, pp. 144-159.
108
 
109
 
110
111
 
112
113
 
114
 
115
PRAMANIK, S., AND FOTOUHI, F. 1985. An index database machine: An efficient m-way join processor. In Proceedings of Hawaii International Conference on System Sciences, vol. 1, pp. 330-338.
116
 
117
 
118
 
119
120
121
122
 
123
 
124
RUDOLPH, J. A. 1972a. A production implementation of an associative array processor. In Proceedings of Fall ,)o~nt Computer Conference, pp. 229-241.
 
125
126
 
127
SAKAI, H., IWATA, K., KAMIYA, S., ABE, M., TANAKA, A., SHIBAYAMA, S., AND MURAKAMI, K. 1984. Design and implementation of the relational database engine. In Proceedings of Conference on Fifth Generation Computer Systems, pp. 419-435.
128
129
130
131
132
 
133
 
134
135
 
136
 
137
TONG, F., AND YAO, S. B., 1982. Performance analysis of database join processors In National Computer Conference, pp. 627-637.
 
138
 
139
VALDURmZ, P. 1982. Semijoin algorithms for distributed databases. In 3rd International Symposium o7~ Distributed Databases.
 
140
VALDUmEZ, P 1986 Optimization of complex database queries using join indices. Database Eng. 9, 4 (Dec.), 10-16.
141
 
142
VALDURmZ, P., AND BORAL, H. 1986. Evaluation of recurmve queries using join indices. In Proceedings of Conference on Expert Database Systems.
 
143
V^LDURIEZ, P, AND GARDAmN, G. 1982. Multiprocessor join algorithms of relations In Improving Usability and Responsiveness. Academic Press, New York, pp. 219-237
144
145
 
146
 
147
 
148
 
149
WHANG, K.-Y., WIEDERHOLD, G., AND SAGALOWICZ, D. 1985. The property of separability and its application to physical database design. In Query Processing *n Database Systems, W Kim, D. S Reiner, and D S. Batory, Eds. Springer- Verlag, New York, pp. 297-317.
 
150
YAO, S B., TONa, F., ANn SHENG Y. Z. 1981 The system architecture of a data base machine (DBM). IEEE Database Eng. Bull. 4 2, 53-62.
 
151
 
152
 
153
 
154
 
155
156
157
 
158
 
159
BROWNSMITH, J D., AND SU, S. Y. W. 1980. Performance analysis of the equijoin operation m the MICRONET computer system In Proceedrags oflCC, pp. 264 268.
 
160
CHASE, K. 1981. Join graphs and acyclic database schemes. In Proceedings of Conference on Very Large Data Bases, pp 95-100
161
 
162
163
 
164
DEW,T% D. J. 1979. DIRECT: A multiprocessor organization for supporting relational database management systems. IEEE Trans. Comput. C-28, 6, 395-406.
 
165
DEWITT, D. J., NAUGHTON, J. F., AND SCHNEIDER, D. A. 1991. An evaluation of non-equijoin algorithms. Tech. Rep. 1011, Univ. of Wisconsin-Madison, Madison, Wlsc.
166
167
168
 
169
 
170
HONEYMAN, P. 1980. Extension joins. In Proceedings of Conferences on Very Large Data Bases, pp. 239-244.
 
171
 
172
KAMIBAYASHI, N, AND SEO, K. 1982. SPIRIT-III: An advanced relational database machine introducing a novel data staging architecture with tuple stream filters to preprocess relational algebra. In National Computer Conference Proceedings, pp. 605-616.
173
 
174
K~NT, W. 1979. The entity join. In Proceedings of Conference on Very large Data Bases, pp. 232 -238.
175
 
176
177
 
178
MENON, M. J. AND HSIAO, D. K. 1983. Design and analysis of join operations of database machines. In Advanced Database Machine Architecture, D. K. Hsiao, Ed. Prentice-Hall, Englewood Cliffs, N.J., pp. 203-255.
179
180
 
181
MERRETT, T, H., KAMBAYASm, Y., AND YASUURA, H. 1981. Scheduling of page fetches in join operations. In Proceedings of Conference on Very Large Data Bases, pp. 488-497.
 
182
 
183
 
184
QADAH, G. Z. 1984. Evaluation of performance of the equi-join operation on the Michigan relational database machine. In Proceedings of Conference on Parallel Processing, pp. 260-265.
 
185
QADAH, G. Z. 1985. The equijoin operation on a multiprocessor database machine. In Proceed~ ings of Interna~ional Workshop on Database Machines, Spriager-Verlag, New York, pp. 35-67.
 
186
 
187
 
188
RISSANEN, J. 1979. Theory of joins for relational databases: A tutorial survey. In Proceedings of Symposium on Mathematical Foundations of Computer Science, Lecture Notes in Computer Science, vol. 64 Springer-Verlag, New York, pp. 537-551.
189
 
190
 
191
SCHUSTER, S. A., }~GUYEN, H. B., OZKARAHAN, E. A., AND SMITH, K. C. 1979. RAP.2: An associative processor for databases and its applications. {EEE Trans. Comput. C-28, 6, 446-458.
192
 
193
SHAW, D. E. ET AL. 1981. The NON-VON database machine: A brief overview. Database Eng. 4, 2
 
194
 
195
Su, S. Y. W., NGUYEN, L. H., EMAN, A., AND LIPOVSKI, G. J. 1979. The architectural features and implementation techniques of multicell CASSM. L~EE Trans. Comput. C-28, 6 (June), 430-445.
 
196
 
197
VARm, M. Y. 1980. Axiomatization of functional and join dependencies in the relational model. Weizman InstiLute M.Sc. thesis, Rehovot, Israel.
 
198
VARDI, M. Y. 1983. Inferring multivalued dependencies from functional and join dependencies. Acta Inf. 19, 2, 305-324.

CITED BY  59


REVIEW

"Christian Mancas : Reviewer"

As the authors note, this overview of join processing in relational databases simply collates information from the available literature. It presents all known join types (general, equi, natural, semi, outer, and self) as well as derived relat  more...

Collaborative Colleagues:
Priti Mishra: colleagues
Margaret H. Eich: colleagues