| Selectivity estimation in spatial databases |
| Full text |
Pdf
(1.45 MB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1999 ACM SIGMOD international conference on Management of data
table of contents
Philadelphia, Pennsylvania, United States
Pages: 13 - 24
Year of Publication: 1999
ISBN:1-58113-084-8
Also published in ...
|
|
Authors
|
|
Swarup Acharya
|
Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
|
|
Viswanath Poosala
|
Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
|
|
Sridhar Ramaswamy
|
Epiphany Inc., 2300 Geng Road, Suite 200, Palo Alto, CA and Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 58, Citation Count: 54
|
|
|
ABSTRACT
Selectivity estimation of queries is an important and well-studied problem in relational database systems. In this paper, we examine selectivity estimation in the context of Geographic Information Systems, which manage spatial data such as points, lines, poly-lines and polygons. In particular, we focus on point and range queries over two-dimensional rectangular data. We propose several techniques based on using spatial indices, histograms, binary space partitionings (BSPs), and the novel notion of spatial skew. Our techniques carefully partition the input rectangles into subsets and approximate each partition accurately. We present a detailed experimental study comparing the proposed techniques and the best known sampling and parametric techniques. We evaluate them using synthetic as well as real-life TIGER datasets. Based on our experiments, we identify a BSP based partitioning that we call Min-Skew which consistently provides the most accurate selectivity estimates for spatial queries. The Min-Skew partitioning can be constructed efficiently, occupies very little space, and provides accurate selectivity estimates over a broad range of spatial queries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Ant92
|
|
| |
Aok97
|
|
 |
APR99
|
Swarup Acharya , Viswanath Poosala , Sridhar Ramaswamy, Selectivity estimation in spatial databases, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.13-24, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
| |
ARC93
|
ARC/INFO. Understanding GIS--the ARCflNFO method. ARC/INFO, 1993. Rev. 6 for workstations.
|
| |
BF95
|
|
 |
BKSS90
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
CR94
|
|
| |
DeW94
|
|
| |
GG82
|
|
| |
GRSS97
|
|
 |
Gut85
|
|
| |
HNSS95
|
|
| |
Int97
|
Intergraph Corp. MGE Z O, "hap://www.intergraph.com/iss/products/mge/mge- 7.0.htm", 1997.
|
| |
KMS97
|
|
| |
Koo80
|
|
 |
LNS90
|
Richard J. Lipton , Jeffrey F. Naughton , Donovan A. Schneider, Practical selectivity estimation through adaptive sampling, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.1-11, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
Map98
|
Maplnfo Corp. The Maplnfo Story, "hap://www.mapinfo.com/mapinfo/mapinfostory.htmr', 1998.
|
| |
MPS99
|
|
| |
PI97
|
|
 |
PIHS96
|
Viswanath Poosala , Peter J. Haas , Yannis E. Ioannidis , Eugene J. Shekita, Improved histograms for selectivity estimation of range predicates, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.294-305, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
Poo97
|
|
 |
PSC84
|
|
 |
SAC+79
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
| |
Sam89a
|
|
| |
Sam89b
|
H. Street. The Design and Analyses of Spatial Data Structures. Addison Wesley, MA, 1989.
|
 |
SFGM93
|
|
| |
SRF87
|
|
| |
tig92
|
Tiger/line files (tm), 1992 technical documentation. Technical report, U. S. Bureau of the Census, 1992.
|
 |
TS96
|
|
 |
Ube94
|
|
| |
Zip49
|
G.K. Zipf. Human behaviour and the principle of least effort. Addison-Wesley, Reading, MA, 1949.
|
CITED BY 54
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jun Zhang , Manli Zhu , Dimitris Papadias , Yufei Tao , Dik Lun Lee, Location-based spatial queries, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, June 09-12, 2003, San Diego, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xuemin Lin , Qing Liu , Yidong Yuan , Xiaofang Zhou, Multiscale histograms: summarizing topological relations in large spatial datasets, Proceedings of the 29th international conference on Very large data bases, p.814-825, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|