ACM Home Page
Please provide us with feedback. Feedback
Hashing by proximity to process duplicates in spatial databases
Full text PdfPdf (953 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the third international conference on Information and knowledge management table of contents
Gaithersburg, Maryland, United States
Pages: 347 - 354  
Year of Publication: 1994
ISBN:0-89791-674-3
Authors
Walid G. Aref  Matsushita Information Technology Laboratory, Two Research Way, Princeton, New Jersey
Hanan Samet  Computer Science Department and Center for Automation Research and Institute for Advanced Computer Studies, The University of Maryland College Park, Maryland
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
NIST : National Institue of Standards & Technology
UMBC : U of MD Baltimore County
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 22,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/191246.191307
What is a DOI?

ABSTRACT

In a spatial database, an object may extend arbitrarily in space. As a result, many spatial data structures (e.g., the quadtree, the cell tree, the R+-tree) represent an object by partitioning it into multiple, yet simple, pieces, each of which is stored separately inside the data structure. Many operations on these data structures are likely to produce duplicate results because of the multiplicity of object pieces. A novel approach for duplicate processing based on proximity of spatial objects is presented. This is different from conventional duplicate elimination in database systems because, with spatial databases, different pieces of the same object can span multiple buckets of the underlying data structure. Example algorithms are presented to perform duplicate processing using proximity for quadtree representation of line segments and arbitrary rectangles. The complexity of the algorithms is seen to depend on a geometric classification of different instances of the spatial objects. By using proximity and the spatial properties of the objects, the number of disk-I/O requests as well as the run-time storage during duplicate processing can be reduced.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
W. G. Aref and H. Samet. Uniquely reporting spatial objects: Yet another operation for comparing spatial data structures. In Proceedings o.f the 5th International Symposium on Spatial Data Handling, pages 178-189, Charleston, SC, August 1992.
 
3
M. B. Dillencourt and H. Samet. Using topological sweep to extract the boundaries of regions in maps represented by region quadtrees. Submitted for publication, 1991.
4
 
5
6
 
7
 
8
9
 
10
A Klinger. Patterns and search statistics. In J. S. Rustagi, editor, Optimizing Methods in Statistics, pages 303-337. Academic Press, New York, 1971.
11
12
 
13
14
 
15
 
16


Collaborative Colleagues:
Walid G. Aref: colleagues
Hanan Samet: colleagues