|
ABSTRACT
This paper describes PBSM (Partition Based Spatial-Merge), a new algorithm for performing spatial join operation. This algorithm is especially effective when neither of the inputs to the join have an index on the joining attribute. Such a situation could arise if both inputs to the join are intermediate results in a complex query, or in a parallel environment where the inputs must be dynamically redistributed. The PBSM algorithm partitions the inputs into manageable chunks, and joins them using a computational geometry based plane-sweeping technique. This paper also presents a performance study comparing the the traditional indexed nested loops join algorithm, a spatial join algorithm based on joining spatial indices, and the PBSM algorithm. These comparisons are based on complete implementations of these algorithms in Paradise, a database system for handling GIS applications. Using real data sets, the performance study examines the behavior of these spatial join algorithms in a variety of situations, including the cases when both, one, or none of the inputs to the join have an suitable index. The study also examines the effect of clustering the join inputs on the performance of these join algorithms. The performance comparisons demonstrates the feasibility, and applicability of the PBSM join algorithm.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Arc95
|
ESRI, Redlands, CA. "ARC/INFO: The World's GIS. An ESRI White Paper", March 1995.
|
 |
Ben75
|
|
| |
Ben79
|
J. L. Bentley. "Multidlmensionat Binary Search Trees in Database Applications". In IEEE Trat~sactions on Software Engineering, volume 5(4), 1979.
|
| |
BHF93
|
|
 |
BKS93
|
Thomas Brinkhoff , Hans-Peter Kriegel , Bernhard Seeger, Efficient processing of spatial joins using R-trees, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.237-246, May 25-28, 1993, Washington, D.C., United States
|
 |
BKSS90
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
BKSS94
|
Thomas Brinkhoff , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, Multi-step processing of spatial joins, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.197-208, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
Bur86
|
R A. Burrough. "Principles of Geographic hzformation Systems for Land Resources Assessment". Oxford University Press, 1986.
|
 |
CDF+94
|
Michael J. Carey , David J. DeWitt , Michael J. Franklin , Nancy E. Hall , Mark L. McAuliffe , Jeffrey F. Naughton , Daniel T. Schuh , Marvin H. Solomon , C. K. Tan , Odysseas G. Tsatalos , Seth J. White , Michael J. Zwilling, Shoring up persistent applications, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.383-394, May 24-27, 1994, Minneapolis, Minnesota, United States
|
 |
CFR87
|
Christos Faloutsos , Timos Sellis , Nick Roussopoulos, Analysis of object oriented spatial access methods, Proceedings of the 1987 ACM SIGMOD international conference on Management of data, p.426-439, May 27-29, 1987, San Francisco, California, United States
|
| |
Cor95
|
Intergraph Corporation. "GIS/AM/FM Information". "http://wwm intergraph'c~m/utihnap'shtmt ", 1995.
|
| |
DKL+94
|
|
| |
DLPY93
|
D. J. DeWitt, J. Luo, J. M. PateI, and J. Yu. "Paradise -A Parallel Geographic Information System". In Proceedings of the A CM Workshop on Advance~ in Geographic Information Systems, Arlington, Virginia, November 1993.
|
| |
DNSS92
|
|
| |
GS87
|
|
| |
Gün93
|
|
 |
Gut84
|
|
| |
HNKT90
|
|
| |
HS95
|
|
| |
KHT89
|
|
 |
LR94
|
|
| |
LR95
|
|
 |
LR96
|
|
| |
MC80
|
|
| |
MGR91
|
D.J. Maguire, M. E Goodchild, and D. W. Rhind. "Geographic h~formation Systems" volume 1. Longman Scientific & Technical, copublishect in the US with John Wiley & Sons, Inc. New York, 1991.
|
 |
NHS84
|
|
 |
NS86
|
|
 |
OM84
|
|
| |
OM88
|
|
 |
Ore86
|
|
 |
Ore89
|
|
 |
Ore90
|
|
| |
PD
|
J. M. Patel and D. J. DeWitt. "Partition Based Spatial-Merge Join". http://www.cs.wisc.edu/paradise/paradise.papers.html.
|
| |
PS88
|
|
| |
Rot91
|
D. Rotem. "Spatial Join Indices". in IEEE Transactions on Knowledge and Data Engineering, Kobe, April 1991.
|
 |
SFGM93
|
Michael Stonebraker , Jim Frew , Kenn Gardels , Jeff Meredith, The SEQUOIA 2000 storage benchmark, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.2-11, May 25-28, 1993, Washington, D.C., United States
|
| |
Sto86
|
M. Stonebraker. "The Case for Shared Nothing". Database Engineering, 9( 1 ), 1986.
|
| |
Tig
|
U.S. Bureau of the Census, Washington, DC. "TIGER~Line Files(TM). 1992 Technical Documentation".
|
| |
TY95
|
|
 |
Ube94
|
|
 |
Val87
|
|
| |
ZG90
|
|
CITED BY 75
|
|
Ju-Won Song , Kyu-Young Whang , Young-Koo Lee , Min-Jae Lee , Sang-Wook Kim, Transformation-based spatial join, Proceedings of the eighth international conference on Information and knowledge management, p.15-26, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
Christian Böhm , Bernhard Braunmüller , Markus Breunig , Hans-Peter Kriegel, High performance clustering based on the similarity join, Proceedings of the ninth international conference on Information and knowledge management, p.298-305, November 06-11, 2000, McLean, Virginia, United States
|
|
|
Lars Arge , Octavian Procopiuc , Sridhar Ramaswamy , Torsten Suel , Jeffrey Scott Vitter, Theory and practice of I/O-efficient algorithms for multidimensional batched searching problems, Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms, p.685-694, January 25-27, 1998, San Francisco, California, United States
|
|
|
|
|
|
|
|
|
Jin-Yi Cai , Venkatesan T. Chakaravarthy , Raghav Kaushik , Jeffrey F. Naughton, On the complexity of join predicates, Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.207-214, May 2001, Santa Barbara, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jignesh Patel , JieBing Yu , Navin Kabra , Kristin Tufte , Biswadeep Nag , Josef Burger , Nancy Hall , Karthikeyan Ramasamy , Roger Lueder , Curt Ellmann , Jim Kupsch , Shelly Guo , Johan Larson , David De Witt , Jeffrey Naughton, Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation, ACM SIGMOD Record, v.26 n.2, p.336-347, June 1997
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mohamed F. Mokbel , Walid G. Aref , Susanne E. Hambrusch , Sunil Prabhakar, Towards scalable location-aware services: requirements and research issues, Proceedings of the 11th ACM international symposium on Advances in geographic information systems, p.110-117, November 07-08, 2003, New Orleans, Louisiana, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shashi Shekhar , Sanjay Chawla , Siva Ravada , Andrew Fetterer , Xuan Liu , Chang-tien Lu, Spatial Databases-Accomplishments and Research Needs, IEEE Transactions on Knowledge and Data Engineering, v.11 n.1, p.45-55, January 1999
|
|
|
|
|
|
Xin Zhang , Nikos Mamoulis , David W. Cheung , Yutao Shou, Fast mining of spatial collocations, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jens-Peter Dittrich , Bernhard Seeger , David Scot Taylor , Peter Widmayer, On producing join results early, Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.134-142, June 09-11, 2003, San Diego, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chenyi Xia , Hongjun Lu , Beng Chin Ooi , Jing Hu, Gorder: an efficient method for KNN join processing, Proceedings of the Thirtieth international conference on Very large data bases, p.756-767, August 31-September 03, 2004, Toronto, Canada
|
|
|
Jens-Peter Dittrich , Bernhard Seeger , David Scot Taylor , Peter Widmayer, Progressive merge join: a generic and non-blocking sort-based join algorithm, Proceedings of the 28th international conference on Very Large Data Bases, p.299-310, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
|
|
|
Jochen Van den Bercken , Björn Blohsfeld , Jens-Peter Dittrich , Jürgen Krämer , Tobias Schäfer , Martin Schneider , Bernhard Seeger, XXL - A Library Approach to Supporting Efficient Implementations of Advanced Database Queries, Proceedings of the 27th International Conference on Very Large Data Bases, p.39-48, September 11-14, 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|