|
ABSTRACT
Many Geographic Information Systems (GIS) handle large geospatial datasets stored in raster representation. Spatial joins over raster data are important queries in GIS for data analysis and decision support. However, evaluating spatial joins can be very time intensive due to the size of these datasets. In this paper we propose a new interactive framework that allows users to get approximate answers in near instantaneous time, thus allowing for truly interactive data exploration. Our method utilizes two proposed statistical approaches: probabilistic join and sampling based join. Our probabilistic join method provides speedup of two orders of magnitude with no correctness guarantee, while our sampling based method provides an order of magnitude improvement over the full quad-tree join and also provides running confidence intervals. We propose a framework that combines the two approaches to allow end users to tradeoff speed versus bounded accuracy. The two approaches are evaluated empirically with real and synthetic datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
Leonardo Guerreiro Azevedo , Ralf Hartmut Güting , Rafael Brand Rodrigues , Geraldo Zimbrão , Jano Moreira de Souza, Filtering with raster signatures, Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems, November 10-11, 2006, Arlington, Virginia, USA
[doi> 10.1145/1183471.1183503]
|
 |
5
|
|
 |
6
|
Reynold Cheng , Sarvjeet Singh , Sunil Prabhakar , Rahul Shah , Jeffrey Scott Vitter , Yuni Xia, Efficient join processing over uncertain data, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
[doi> 10.1145/1183614.1183719]
|
| |
7
|
|
| |
8
|
F. Olken. Random Sampling from Databases. PhD thesis, University of California at Berkeley, 1993.
|
| |
9
|
R. J. Serfling. Basic Statistics for Business and Economics. McGraw-Hill, 2002.
|
| |
10
|
H. Tveite. Data Modeling and Database Requirements for Geographical Data. PhD thesis, University of Norway, 1997.
|
| |
11
|
|
| |
12
|
W. D. Bae, S. Alkobaisi, and S. T. Leutenegger. An incremental refinining spatial join algorithm for estimating qeury results in GIS. In Proceedings of DEXA, pages 935--944, 2006.
|
| |
13
|
W. D. Bae, S. Alkobaisi, and S. T. Leutenegger. IRSJ: Incremental refining spatial joins for interactive queries in GIS. In Technical Report DU-CS-07-10. University of Denver, 2007.
|
| |
14
|
USGS. http://tin.er.usgs.gov/, 2001, 2005.
|
|